Timezone: »
The application of an ensemble of neural networks is becoming an imminent tool for advancing state-of-the-art deep reinforcement learning algorithms. However, training these large numbers of neural networks in the ensemble has anexceedingly high computation cost which may become a hindrance in training large-scale systems. In this paper, we propose DNS: a DeterminantalPoint Process based Neural Network Sampler that specifically uses k-DPP to sample a subset of neural networks for backpropagation at every training step thus significantly reducing the training time and computation cost. We integrated DNS in REDQ for continuous control tasks and evaluated on MuJoCo environments. Our experiments show that DNS augmented REDQ matches the baseline REDQ in terms of average cumulative reward and achieves this using less than 50% computation when measured in FLOPS. The code is available at https://github.com/IntelLabs/DNS
Author Information
Hassam Sheikh (Intel Labs)
Kizza Nandyose Frisbee (Intel Corporation)
mariano phielipp (Intel AI Labs)
Related Events (a corresponding poster, oral, or spotlight)
-
2022 Poster: DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning »
Wed. Jul 20th through Thu the 21st Room Hall E #819
More from the Same Authors
-
2021 : The Reflective Explorer: Online Meta-Exploration from Offline Data in Visual Tasks with Sparse Rewards »
Rafael Rafailov · Varun Kumar · Tianhe (Kevin) Yu · Avi Singh · mariano phielipp · Chelsea Finn -
2022 Poster: AnyMorph: Learning Transferable Polices By Inferring Agent Morphology »
Brandon Trabucco · mariano phielipp · Glen Berseth -
2022 Spotlight: AnyMorph: Learning Transferable Polices By Inferring Agent Morphology »
Brandon Trabucco · mariano phielipp · Glen Berseth