Skip to yearly menu bar Skip to main content


Poster

ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

Zihan Lin ⋅ Xiaohan Wang ⋅ Jie Cao ⋅ Jiajun Chai ⋅ Li Wang ⋅ Xiaodong Lu ⋅ Wei Lin ⋅ Ran He ⋅ Guojun Yin

Abstract

Log in and register to view live content