Skip to yearly menu bar Skip to main content


Poster

dTRPO : Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Wenxuan Zhang ⋅ Lemeng Wu ⋅ Changsheng Zhao ⋅ Ernie Chang ⋅ Mingchen Zhuge ⋅ Zechun Liu ⋅ Andy (DiJia) Su ⋅ Hanxian Huang ⋅ Jun Chen ⋅ Chong Zhou ⋅ Raghuraman Krishnamoorthi ⋅ Vikas Chandra ⋅ Mohamed Elhoseiny ⋅ Wei Wen

Abstract

Log in and register to view live content