Skip to yearly menu bar Skip to main content


Poster Tue, Jul 7, 2026 • 10:30 PM – 12:15 AM PDT HALL A #3900

DRIVE: Best Data Scheduling Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

Speed Zhu ⋅ Chuheng Zhang ⋅ Jianwei Cai ⋅ Guang Chen ⋅ Lulu Wu ⋅ Xiaolong Xu ⋅ Xuyun Zhang ⋅ Saiyong Yang ⋅ Wiggin Zhou

Abstract

Log in and register to view live content