Skip to yearly menu bar Skip to main content


Poster

One-Way Policy Optimization for Self-Evolving LLMs

Shuo Yang ⋅ Jinda Lu ⋅ Kexin Huang ⋅ Chiyu Ma ⋅ Shaohang Wei ⋅ Yuyang Liu ⋅ Guoyin Wang ⋅ Jingren Zhou ⋅ Li Yuan

Abstract

Log in and register to view live content