Skip to yearly menu bar Skip to main content


Poster

ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision

Wonjae Kim · Bokyung Son · Ildoo Kim
2021 Poster

Abstract

Video

Chat is not available.