Skip to yearly menu bar Skip to main content


Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Yang Yue ⋅ Zhiqi Chen ⋅ Rui Lu ⋅ Andrew Zhao ⋅ Zhaokai Wang ⋅ Yang Yue ⋅ Shiji Song ⋅ Gao Huang

Abstract

Video

Chat is not available.