Timezone: »
Theory of mind, the ability to model others' thoughts and desires, is a cornerstone of human social intelligence. This makes it an important challenge for the machine learning community, but previous works mainly attempt to design agents that model the "mental state" of others as passive observers or in specific predefined roles, such as in speaker-listener scenarios. In contrast, we propose to model machine theory of mind in a more general symmetric scenario. We introduce a multi-agent environment SymmToM where, like in real life, all agents can speak, listen, see other agents, and move freely through the world. Effective strategies to maximize an agent's reward require it to develop a theory of mind. We show that reinforcement learning agents that model the mental states of others achieve significant performance improvements over agents with no such theory of mind model. Importantly, our best agents still fail to achieve performance comparable to agents with access to the gold-standard mental state of other agents, demonstrating that the modeling of theory of mind in multi-agent scenarios is very much an open challenge.
Author Information
Melanie Sclar (University of Washington)
Graham Neubig (Carnegie Mellon University)
Yonatan Bisk (Carnegie Mellon University)
Related Events (a corresponding poster, oral, or spotlight)
-
2022 Spotlight: Symmetric Machine Theory of Mind »
Wed. Jul 20th 09:35 -- 09:40 PM Room Hall F
More from the Same Authors
-
2023 : Adapting to Gradual Distribution Shifts with Continual Weight Averaging »
Jared Fernandez · Saujas Vaduguru · Sanket Vaibhav Mehta · Yonatan Bisk · Emma Strubell -
2023 : Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker »
Melanie Sclar · Sachin Kumar · Peter West · Alane Suhr · Yejin Choi · Yulia Tsvetkov -
2023 : Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker »
Melanie Sclar · Sachin Kumar · Peter West · Alane Suhr · Yejin Choi · Yulia Tsvetkov -
2023 Oral: Cross-Modal Fine-Tuning: Align then Refine »
Junhong Shen · Liam Li · Lucio Dery · Corey Staten · Mikhail Khodak · Graham Neubig · Ameet Talwalkar -
2023 Poster: Cross-Modal Fine-Tuning: Align then Refine »
Junhong Shen · Liam Li · Lucio Dery · Corey Staten · Mikhail Khodak · Graham Neubig · Ameet Talwalkar -
2023 Poster: PAL: Program-aided Language Models »
Luyu Gao · Aman Madaan · Shuyan Zhou · Uri Alon · Pengfei Liu · Yiming Yang · Jamie Callan · Graham Neubig -
2023 Poster: Why do Nearest Neighbor Language Models Work? »
Frank Xu · Uri Alon · Graham Neubig -
2022 Poster: Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval »
Uri Alon · Frank Xu · Junxian He · Sudipta Sengupta · Dan Roth · Graham Neubig -
2022 Poster: A Framework for Learning to Request Rich and Contextually Useful Information from Humans »
Khanh Nguyen · Yonatan Bisk · Hal Daumé III -
2022 Spotlight: A Framework for Learning to Request Rich and Contextually Useful Information from Humans »
Khanh Nguyen · Yonatan Bisk · Hal Daumé III -
2022 Spotlight: Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval »
Uri Alon · Frank Xu · Junxian He · Sudipta Sengupta · Dan Roth · Graham Neubig -
2021 Poster: Examining and Combating Spurious Features under Distribution Shift »
Chunting Zhou · Xuezhe Ma · Paul Michel · Graham Neubig -
2021 Poster: Few-shot Language Coordination by Modeling Theory of Mind »
Hao Zhu · Graham Neubig · Yonatan Bisk -
2021 Spotlight: Few-shot Language Coordination by Modeling Theory of Mind »
Hao Zhu · Graham Neubig · Yonatan Bisk -
2021 Spotlight: Examining and Combating Spurious Features under Distribution Shift »
Chunting Zhou · Xuezhe Ma · Paul Michel · Graham Neubig -
2020 Poster: Optimizing Data Usage via Differentiable Rewards »
Xinyi Wang · Hieu Pham · Paul Michel · Antonios Anastasopoulos · Jaime Carbonell · Graham Neubig -
2020 Poster: XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation »
Junjie Hu · Sebastian Ruder · Aditya Siddhant · Graham Neubig · Orhan Firat · Melvin Johnson