Skip to yearly menu bar Skip to main content


Poster

DRIVE: Best Data Scheduling Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Speed Zhu ⋅ Chuheng Zhang ⋅ Jianwei Cai ⋅ Guang Chen ⋅ Lulu Wu ⋅ Xiaolong Xu ⋅ Xuyun Zhang ⋅ Saiyong Yang ⋅ Wiggin Zhou

Abstract

Log in and register to view live content