Timezone: »
The ever-growing complexity of reinforcement learning (RL) tasks demands a distributed system to train intelligent agents by efficiently producing and processing a massive amount of data. In this paper, we propose a more comprehensive computational abstraction for RL training tasks and introduce a general, scalable, and efficient RL system called Really Scalable RL (SRL), featuring a novel architecture that separates three major computation components in RL training. Our evaluation demonstrates that SRL outperforms a popular open-source RL system RLlib RLlib (Liang et al., 2017) in training throughput. Moreover, to assess the learning performance of SRL, we have conducted a benchmark on a large scale cluster with 32 Nvidia A100 GPUs, 64 Nvidia RTX 3090 GPUs and more than 10000 CPU cores, reproducing the results of industrial production system from OpenAI, Rapid (Berner et al., 2019) in the hide and-seek environment (Baker et al., 2019). The results show that SRL is capable of achieving up to 5 times training speedup compared to published results in Baker et al. (2019).
Author Information
Zhiyu Mei (Tsinghua University, Tsinghua University)
Wei Fu (Tsinghua University)
Guangju Wang
Huanchen Zhang
Yi Wu (Tsinghua University & Shanghai Qi Zhi Institute)
More from the Same Authors
-
2021 : Disentangled Attention as Intrinsic Regularization for Bimanual Multi-Object Manipulation »
Minghao Zhang · Pingcheng Jian · Yi Wu · Harry (Huazhe) Xu · Xiaolong Wang -
2022 : Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning »
Zhecheng Yuan · Zhecheng Yuan · Zhengrong Xue · Zhengrong Xue · Bo Yuan · Bo Yuan · Xueqian Wang · Xueqian Wang · Yi Wu · Yi Wu · Yang Gao · Yang Gao · Huazhe Xu · Huazhe Xu -
2022 Poster: Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning »
Yunfei Li · Tian Gao · Jiaqi Yang · Huazhe Xu · Yi Wu -
2022 Spotlight: Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning »
Yunfei Li · Tian Gao · Jiaqi Yang · Huazhe Xu · Yi Wu -
2022 Poster: Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning »
Wei Fu · Chao Yu · Zelai Xu · Jiaqi Yang · Yi Wu -
2022 Spotlight: Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning »
Wei Fu · Chao Yu · Zelai Xu · Jiaqi Yang · Yi Wu -
2018 Poster: Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms »
Yi Wu · Siddharth Srivastava · Nicholas Hay · Simon Du · Stuart Russell -
2018 Oral: Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms »
Yi Wu · Siddharth Srivastava · Nicholas Hay · Simon Du · Stuart Russell