Timezone: »
Since the introduction of DQN, a vast majority of reinforcement learning research has focused on reinforcement learning with deep neural networks as function approximators. New methods are typically evaluated on a set of environments that have now become standard, such as Atari 2600 games. While these benchmarks help standardize evaluation, their computational cost has the unfortunate side effect of widening the gap between those with ample access to computational resources, and those without. In this work we argue that, despite the community’s emphasis on large-scale environments, the traditional small-scale environments can still yield valuable scientific insights and can help reduce the barriers to entry for underprivileged communities. To substantiate our claims, we empirically revisit the paper which introduced the Rainbow algorithm [Hessel et al., 2018] and present some new insights into the algorithms used by Rainbow.
Author Information
Johan Obando Ceron (UAO)
Pablo Samuel Castro (Google Brain)
Pablo was born and raised in Quito, Ecuador, and moved to Montreal after high school to study at McGill. He stayed in Montreal for the next 10 years, finished his bachelors, worked at a flight simulator company, and then eventually obtained his masters and PhD at McGill, focusing on Reinforcement Learning. After his PhD Pablo did a 10-month postdoc in Paris before moving to Pittsburgh to join Google. He has worked at Google for almost 6 years, and is currently a research Software Engineer in Google Brain in Montreal, focusing on fundamental Reinforcement Learning research, as well as Machine Learning and Music. Aside from his interest in coding/AI/math, Pablo is an active musician (https://www.psctrio.com), loves running (5 marathons so far, including Boston!), and discussing politics and activism.
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Poster: Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research »
Tue. Jul 20th 04:00 -- 06:00 PM Room
More from the Same Authors
-
2021 : A functional mirror ascent view of policy gradient methods with function approximation »
Sharan Vaswani · Olivier Bachem · Simone Totaro · Matthieu Geist · Marlos C. Machado · Pablo Samuel Castro · Nicolas Le Roux -
2023 Poster: The Dormant Neuron Phenomenon in Deep Reinforcement Learning »
Ghada Sokar · Rishabh Agarwal · Pablo Samuel Castro · Utku Evci -
2023 Oral: The Dormant Neuron Phenomenon in Deep Reinforcement Learning »
Ghada Sokar · Rishabh Agarwal · Pablo Samuel Castro · Utku Evci -
2023 Poster: Bigger, Better, Faster: Human-level Atari with human-level efficiency »
Max Schwarzer · Johan Obando Ceron · Aaron Courville · Marc Bellemare · Rishabh Agarwal · Pablo Samuel Castro -
2022 : Estimating Policy Functions in Payments Systems Using Reinforcement Learning »
Pablo Samuel Castro -
2022 Poster: The State of Sparse Training in Deep Reinforcement Learning »
Laura Graesser · Utku Evci · Erich Elsen · Pablo Samuel Castro -
2022 Spotlight: The State of Sparse Training in Deep Reinforcement Learning »
Laura Graesser · Utku Evci · Erich Elsen · Pablo Samuel Castro -
2020 Poster: Rigging the Lottery: Making All Tickets Winners »
Utku Evci · Trevor Gale · Jacob Menick · Pablo Samuel Castro · Erich Elsen