Timezone: »
Oral
Modeling Others using Oneself in Multi-Agent Reinforcement Learning
Roberta Raileanu · Emily Denton · Arthur Szlam · Facebook Rob Fergus
We consider the multi-agent reinforcement learningsetting with imperfect information. The rewardfunction depends on the hidden goals ofboth agents, so the agents must infer the otherplayers’ goals from their observed behavior inorder to maximize their returns. We propose anew approach for learning in these domains: SelfOther-Modeling (SOM), in which an agent usesits own policy to predict the other agent’s actionsand update its belief of their hidden goal in an onlinemanner. We evaluate this approach on threedifferent tasks and show that the agents are ableto learn better policies using their estimate of theother players’ goals, in both cooperative and competitivesettings.
Author Information
Roberta Raileanu (NYU)
Emily Denton (New York University)
Arthur Szlam (Facebook AI Research)
Facebook Rob Fergus (Facebook AI Research, NYU)
Related Events (a corresponding poster, oral, or spotlight)
-
2018 Poster: Modeling Others using Oneself in Multi-Agent Reinforcement Learning »
Thu. Jul 12th 04:15 -- 07:00 PM Room Hall B #136
More from the Same Authors
-
2021 Oral: Decoupling Value and Policy for Generalization in Reinforcement Learning »
Roberta Raileanu · Rob Fergus -
2021 Poster: Decoupling Value and Policy for Generalization in Reinforcement Learning »
Roberta Raileanu · Rob Fergus -
2020 : Automatic Data Augmentation for Generalization in Reinforcement Learning »
Roberta Raileanu -
2020 Poster: Fast Adaptation to New Environments via Policy-Dynamics Value Functions »
Roberta Raileanu · Max Goldstein · Arthur Szlam · Facebook Rob Fergus -
2019 Workshop: Workshop on Multi-Task and Lifelong Reinforcement Learning »
Sarath Chandar · Shagun Sodhani · Khimya Khetarpal · Tom Zahavy · Daniel J. Mankowitz · Shie Mannor · Balaraman Ravindran · Doina Precup · Chelsea Finn · Abhishek Gupta · Amy Zhang · Kyunghyun Cho · Andrei A Rusu · Facebook Rob Fergus -
2018 Poster: Stochastic Video Generation with a Learned Prior »
Emily Denton · Rob Fergus -
2018 Oral: Stochastic Video Generation with a Learned Prior »
Emily Denton · Rob Fergus -
2018 Poster: Composable Planning with Attributes »
Amy Zhang · Sainbayar Sukhbaatar · Adam Lerer · Arthur Szlam · Facebook Rob Fergus -
2018 Oral: Composable Planning with Attributes »
Amy Zhang · Sainbayar Sukhbaatar · Adam Lerer · Arthur Szlam · Facebook Rob Fergus