Skip to yearly menu bar Skip to main content


Poster

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Zhiyuan Zeng ⋅ Hamish Ivison ⋅ Yiping Wang ⋅ Lifan Yuan ⋅ Stella Li ⋅ Zhuorui Ye ⋅ Siting Li ⋅ Jacqueline He ⋅ Runlong Zhou ⋅ Tong Chen ⋅ Chenyang Zhao ⋅ Yulia Tsvetkov ⋅ Simon Du ⋅ Natasha Jaques ⋅ Hao Peng ⋅ Pang Wei Koh ⋅ Hannaneh Hajishirzi

Abstract

Log in and register to view live content