Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

Transferability for General Reasoning: An Automated Curriculum for Multi-Domain LLM RL

Yongjin Yang ⋅ Jiarui Liu ⋅ Yinghui He ⋅ Lechen Zhang ⋅ Bernhard Schölkopf ⋅ Zhijing Jin

Abstract

Log in and register to view live content