Skip to yearly menu bar Skip to main content


Poster

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration

Zhicheng Yang ⋅ Zhijiang Guo ⋅ Yinya Huang ⋅ Yongxin Wang ⋅ Dongchun Xie ⋅ Hanhui Li ⋅ Yiwei Wang ⋅ Xiaodan Liang ⋅ Jing Tang

Abstract

Log in and register to view live content