Skip to yearly menu bar Skip to main content


Poster

CPMöbius: Iterative Coach–Player Reasoning for Data-Free Reinforcement Learning

Ran Li ⋅ Zeyuan Liu ⋅ Yinghao Chen ⋅ Bingxiang He ⋅ Jiarui Yuan ⋅ Zixuan Fu ⋅ Weize Chen ⋅ Jinyi Hu ⋅ Chen Qian ⋅ Zhiyuan Liu ⋅ Maosong Sun

Abstract

Log in and register to view live content