Timezone: »
Individuality is essential in human society. It induces the division of labor and thus improves the efficiency and productivity. Similarly, it should also be a key to multi-agent cooperation. Inspired by that individuality is of being an individual separate from others, we propose a simple yet efficient method for the emergence of individuality (EOI) in multi-agent reinforcement learning (MARL). EOI learns a probabilistic classifier that predicts a probability distribution over agents given their observation and gives each agent an intrinsic reward of being correctly predicted by the classifier. The intrinsic reward encourages the agents to visit their own familiar observations, and learning the classifier by such observations makes the intrinsic reward signals stronger and in turn makes the agents more identifiable. To further enhance the intrinsic reward and promote the emergence of individuality, two regularizers are proposed to increase the discriminability of the classifier. We implement EOI on top of popular MARL algorithms. Empirically, we show that EOI outperforms existing methods in a variety of multi-agent cooperative scenarios.
Author Information
Jiechuan Jiang (Peking University)
Zongqing Lu (Peking University)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Poster: The Emergence of Individuality »
Wed. Jul 21st 04:00 -- 06:00 AM Room
More from the Same Authors
-
2022 Poster: Divergence-Regularized Multi-Agent Actor-Critic »
Kefan Su · Zongqing Lu -
2022 Poster: Difference Advantage Estimation for Multi-Agent Policy Gradients »
yueheng li · Guangming Xie · Zongqing Lu -
2022 Poster: Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning »
Haoqi Yuan · Zongqing Lu -
2022 Spotlight: Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning »
Haoqi Yuan · Zongqing Lu -
2022 Spotlight: Divergence-Regularized Multi-Agent Actor-Critic »
Kefan Su · Zongqing Lu -
2022 Spotlight: Difference Advantage Estimation for Multi-Agent Policy Gradients »
yueheng li · Guangming Xie · Zongqing Lu -
2021 : RL + Operations Research Panel »
Jim Dai · Fei Fang · Shie Mannor · Yuandong Tian · Zhiwei (Tony) Qin · Zongqing Lu -
2021 Workshop: Reinforcement Learning for Real Life »
Yuxi Li · Minmin Chen · Omer Gottesman · Lihong Li · Zongqing Lu · Rupam Mahmood · Niranjani Prasad · Zhiwei (Tony) Qin · Csaba Szepesvari · Matthew Taylor -
2021 Poster: FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning »
Tianhao Zhang · yueheng li · Chen Wang · Guangming Xie · Zongqing Lu -
2021 Spotlight: FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning »
Tianhao Zhang · yueheng li · Chen Wang · Guangming Xie · Zongqing Lu