Skip to yearly menu bar Skip to main content


Oral

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

Kenton Lee ⋅ Mandar Joshi ⋅ Iulia Turc ⋅ Hexiang Hu ⋅ Fangyu Liu ⋅ Julian M Eisenschlos ⋅ Urvashi Khandelwal ⋅ Peter Shaw ⋅ Ming-Wei Chang ⋅ Kristina Toutanova
2023 Oral
[ PDF

Abstract

Video

Chat is not available.