Skip to yearly menu bar Skip to main content


Poster

POLIA: Policy Optimization with Visual-Object-Level Intrinsic Advantage for Multimodal Reasoning

Yiran Zeng ⋅ Da Chen ⋅ Hangyu Mao ⋅ Yuanxing Zhang ⋅ Pengfei Wan ⋅ Mengchen Zhao

Abstract

Log in and register to view live content