Skip to yearly menu bar Skip to main content


Poster Thu, Jul 9, 2026 • 1:00 AM – 2:45 AM PDT HALL A #1808

One-Way Policy Optimization for Self-Evolving LLMs

Shuo Yang ⋅ Jinda Lu ⋅ Kexin Huang ⋅ Chiyu Ma ⋅ Shaohang Wei ⋅ Yuyang Liu ⋅ Guoyin Wang ⋅ Jingren Zhou ⋅ Li Yuan

Abstract

Log in and register to view live content