Timezone: »
Poster
Modeling Others using Oneself in Multi-Agent Reinforcement Learning
Roberta Raileanu · Emily Denton · Arthur Szlam · Facebook Rob Fergus
We consider the multi-agent reinforcement learning setting with imperfect information. The reward function depends on the hidden goals of both agents, so the agents must infer the other players’ goals from their observed behavior in order to maximize their returns. We propose a new approach for learning in these domains: Self Other-Modeling (SOM), in which an agent uses its own policy to predict the other agent’s actions and update its belief of their hidden goal in an online manner. We evaluate this approach on three different tasks and show that the agents are able to learn better policies using their estimate of the other players’ goals, in both cooperative and competitive settings.
Author Information
Roberta Raileanu (NYU)
Emily Denton (New York University)
Arthur Szlam (Facebook AI Research)
Facebook Rob Fergus (Facebook AI Research, NYU)
Related Events (a corresponding poster, oral, or spotlight)
-
2018 Oral: Modeling Others using Oneself in Multi-Agent Reinforcement Learning »
Thu. Jul 12th 09:40 -- 09:50 AM Room A3
More from the Same Authors
-
2021 Oral: Decoupling Value and Policy for Generalization in Reinforcement Learning »
Roberta Raileanu · Rob Fergus -
2021 Poster: Decoupling Value and Policy for Generalization in Reinforcement Learning »
Roberta Raileanu · Rob Fergus -
2020 : Automatic Data Augmentation for Generalization in Reinforcement Learning »
Roberta Raileanu -
2020 Poster: Fast Adaptation to New Environments via Policy-Dynamics Value Functions »
Roberta Raileanu · Max Goldstein · Arthur Szlam · Facebook Rob Fergus -
2019 Workshop: Workshop on Multi-Task and Lifelong Reinforcement Learning »
Sarath Chandar · Shagun Sodhani · Khimya Khetarpal · Tom Zahavy · Daniel J. Mankowitz · Shie Mannor · Balaraman Ravindran · Doina Precup · Chelsea Finn · Abhishek Gupta · Amy Zhang · Kyunghyun Cho · Andrei A Rusu · Facebook Rob Fergus -
2018 Poster: Stochastic Video Generation with a Learned Prior »
Emily Denton · Rob Fergus -
2018 Oral: Stochastic Video Generation with a Learned Prior »
Emily Denton · Rob Fergus -
2018 Poster: Composable Planning with Attributes »
Amy Zhang · Sainbayar Sukhbaatar · Adam Lerer · Arthur Szlam · Facebook Rob Fergus -
2018 Oral: Composable Planning with Attributes »
Amy Zhang · Sainbayar Sukhbaatar · Adam Lerer · Arthur Szlam · Facebook Rob Fergus