Skip to yearly menu bar Skip to main content


Poster

Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning

Yixuan Xu ⋅ Yash Savani ⋅ Fei Fang ⋅ Zico Kolter

Abstract

Log in and register to view live content