Skip to yearly menu bar Skip to main content


Poster

Do Vision and Text Cues Exhibit Evidential Coupling? UFO: A Benchmark for Compositional Multimodal Reasoning in Unified Models

Zhongyu Yang ⋅ Dannong Xu ⋅ Yonghan Zhang ⋅ Kefan Chen ⋅ Xinyi Wang ⋅ Yang Xu ⋅ Wei Pang ⋅ Yingfang Yuan

Abstract

Log in and register to view live content