Skip to yearly menu bar Skip to main content


Poster Wed, Jul 8, 2026 • 2:30 PM – 4:15 PM KST HALL A

CPMöbius: Iterative Coach–Player Reasoning for Data-Free Reinforcement Learning

Ran Li ⋅ Zeyuan Liu ⋅ Yinghao Chen ⋅ Bingxiang He ⋅ Jiarui Yuan ⋅ Zixuan Fu ⋅ Weize Chen ⋅ Jinyi Hu ⋅ Chen Qian ⋅ Zhiyuan Liu ⋅ Maosong Sun

Abstract

Log in and register to view live content