Timezone: »
Sharing parameters in multi-agent deep reinforcement learning has played an essential role in allowing algorithms to scale to a large number of agents. Parameter sharing between agents significantly decreases the number of trainable parameters, shortening training times to tractable levels, and has been linked to more efficient learning. However, having all agents share the same parameters can also have a detrimental effect on learning. We demonstrate the impact of parameter sharing methods on training speed and converged returns, establishing that when applied indiscriminately, their effectiveness is highly dependent on the environment. We propose a novel method to automatically identify agents which may benefit from sharing parameters by partitioning them based on their abilities and goals. Our approach combines the increased sample efficiency of parameter sharing with the representational capacity of multiple independent networks to reduce training time and increase final returns.
Author Information
Filippos Christianos (University of Edinburgh)
Georgios Papoudakis (The University of Edinburgh)
Muhammad Arrasy Rahman (The University of Edinburgh)
Stefano V. Albrecht (University of Edinburgh)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Spotlight: Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing »
Wed. Jul 21st 01:25 -- 01:30 PM Room
More from the Same Authors
-
2021 : Decoupling Exploration and Exploitation in Reinforcement Learning »
Lukas Schäfer · Filippos Christianos · Josiah Hanna · Stefano V. Albrecht -
2021 Poster: Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning »
Muhammad Arrasy Rahman · Niklas Hopner · Filippos Christianos · Stefano V. Albrecht -
2021 Spotlight: Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning »
Muhammad Arrasy Rahman · Niklas Hopner · Filippos Christianos · Stefano V. Albrecht