Skip to yearly menu bar Skip to main content


Oral

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

Kenton Lee · Mandar Joshi · Iulia Turc · Hexiang Hu · Fangyu Liu · Julian M Eisenschlos · Urvashi Khandelwal · Peter Shaw · Ming-Wei Chang · Kristina Toutanova
2023 Oral
[ PDF

Abstract

Video

Chat is not available.