Skip to yearly menu bar Skip to main content


Poster

Reason with Thumbnails, Answer with Focus: An Efficient and Effective Paradigm for Multimodal Grounded Visual Reasoning

An-Lan Wang ⋅ Guozhi Tang ⋅ Lei Liao ⋅ Hanshen Zhu ⋅ Kai Huang ⋅ Jingqun Tang ⋅ Jiaming Zhou ⋅ Kun-Yu Lin

Abstract

Log in and register to view live content