Skip to yearly menu bar Skip to main content


Poster

From Shortcuts to Reasoning: Robust Post-Training of Theory of Mind with Reinforcement Learning

Jike Zhong ⋅ Yuxiang Lai ⋅ Ming Li ⋅ Yuheng Li ⋅ Wuao Liu ⋅ Behzad Dariush ⋅ Konstantinos Psounis ⋅ Shao-Yuan Lo

Abstract

Log in and register to view live content