Skip to yearly menu bar Skip to main content


Poster

UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models

Jiaqi Wang ⋅ Haoge Deng ⋅ Ting Pan ⋅ Yang Liu ⋅ Chengyuan Wang ⋅ Fan Zhang ⋅ Yonggang Qi ⋅ Xinlong Wang

Abstract

Log in and register to view live content