Skip to yearly menu bar Skip to main content


Poster

DARTS: Distribution-Aware Active Rollout Trajectory Shaping for Accelerating LLM Reinforcement Learning

Yujie Wang ⋅ Siwei Chen ⋅ Longzan Luo ⋅ Xinyi Liu ⋅ Xupeng Miao ⋅ Fangcheng Fu ⋅ Bin Cui

Abstract

Log in and register to view live content