Timezone: »
In this paper, we consider cooperative multi-agent reinforcement learning (MARL) with sparse reward. To tackle this problem, we propose a novel method named MASER: MARL with subgoals generated from experience replay buffer. Under the widely-used assumption of centralized training with decentralized execution and consistent Q-value decomposition for MARL, MASER automatically generates proper subgoals for multiple agents from the experience replay buffer by considering both individual Q-value and total Q-value. Then, MASER designs individual intrinsic reward for each agent based on actionable representation relevant to Q-learning so that the agents reach their sub-goals while maximizing the joint action value. Numerical results show that MASER significantly outperforms StarCraft II micromanagement benchmark compared to other state-of-the-art MARL algorithms.
Author Information
JEON JEEWON (Korea Advanced Institute of Science and Technology)
WOOJUN KIM (KAIST)
Whiyoung Jung (KAIST)
Youngchul Sung (KAIST)
Related Events (a corresponding poster, oral, or spotlight)
-
2022 Spotlight: MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer »
Wed. Jul 20th 05:55 -- 06:00 PM Room None
More from the Same Authors
-
2022 : An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning »
WOOJUN KIM · Youngchul Sung -
2022 : A Variational Approach to Mutual Information-Based Coordination for Multi-Agent Reinforcement Learning »
WOOJUN KIM -
2022 Poster: Robust Imitation Learning against Variations in Environment Dynamics »
Jongseong Chae · Seungyul Han · Whiyoung Jung · MYUNG-SIK CHO · Sungho Choi · Youngchul Sung -
2022 Spotlight: Robust Imitation Learning against Variations in Environment Dynamics »
Jongseong Chae · Seungyul Han · Whiyoung Jung · MYUNG-SIK CHO · Sungho Choi · Youngchul Sung -
2021 Poster: Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration »
Seungyul Han · Youngchul Sung -
2021 Spotlight: Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration »
Seungyul Han · Youngchul Sung -
2019 Poster: Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient Reinforcement Learning »
Seungyul Han · Youngchul Sung -
2019 Oral: Dimension-Wise Importance Sampling Weight Clipping for Sample-Efficient Reinforcement Learning »
Seungyul Han · Youngchul Sung