Skip to yearly menu bar Skip to main content


Poster

AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in Unified Multimodal Models via Decompositional Verifiable Reward

Runhui Huang ⋅ Jie Wu ⋅ Rui Yang ⋅ Zhe Liu ⋅ Hengshuang Zhao

Abstract

Log in and register to view live content