50 Results

Poster
Tue 7:00 What Can Learned Intrinsic Rewards Capture?
Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh
Poster
Tue 7:00 Taylor Expansion Policy Optimization
Yunhao Tang, Michal Valko, Remi Munos
Poster
Tue 8:00 Working Memory Graphs
Ricky Loynd, Roland Fernandez, Asli Celikyilmaz, Adith Swaminathan, Matthew Hausknecht
Poster
Tue 8:00 Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards
Umer Siddique, Paul Weng, Matthieu Zimmer
Poster
Tue 9:00 Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination
Somdeb Majumdar, Shauharda Khadka, Santiago Miret, Stephen Mcaleer, Kagan Tumer
Poster
Tue 9:00 Variational Imitation Learning with Diverse-quality Demonstrations
Voot Tangkaratt, Bo Han, Emti Khan, Masashi Sugiyama
Poster
Tue 9:00 Hierarchically Decoupled Imitation For Morphological Transfer
Donald Hejna, Lerrel Pinto, Pieter Abbeel
Poster
Tue 9:00 Learning Compound Tasks without Task-specific Knowledge via Imitation and Self-supervised Learning
Sang-Hyun Lee, Seung-Woo Seo
Poster
Tue 9:00 Enhanced POET: Open-ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
Rui Wang, Joel Lehman, Aditya Rawal, Jiale Zhi, Yulun Li, Jeffrey Clune, Ken Stanley
Poster
Tue 9:00 Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences
Daniel Brown, Russell Coleman, Ravi Srinivasan, Scott Niekum
Poster
Tue 10:00 Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Zhaohan Guo, Bernardo Avila Pires, Bilal Piot, Jean-Bastien Grill, Florent Altché, Remi Munos, Mohammad Gheshlaghi Azar
Poster
Tue 10:00 Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Adam Stooke, Joshua Achiam, Pieter Abbeel
Poster
Tue 11:00 Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills
Victor Campos, Alexander Trott, Caiming Xiong, Richard Socher, Xavier Giro-i-Nieto, Jordi Torres
Poster
Tue 12:00 A distributional view on multi-objective policy optimization
Abbas Abdolmaleki, Sandy Huang, Leonard Hasenclever, Michael Neunert, Francis Song, Martina Zambelli, Murilo Martins, Nicolas Heess, Raia Hadsell, Martin Riedmiller
Poster
Tue 12:00 Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov, Pavel Shvechikov, Alexander Grishin, Dmitry Vetrov
Poster
Tue 12:00 Ready Policy One: World Building Through Active Learning
Philip Ball, Jack Parker-Holder, Aldo Pacchiano, Krzysztof Choromanski, Stephen Roberts
Poster
Tue 13:00 Generalization to New Actions in Reinforcement Learning
Ayush Jain, Andrew Szot, Joseph Lim
Poster
Tue 13:00 Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko, Zhehui Huang, Tushar Kumar, Gaurav Sukhatme, Vladlen Koltun
Poster
Tue 13:00 Growing Action Spaces
Gregory Farquhar, Laura Gustafson, Zeming Lin, Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve
Poster
Tue 14:00 OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning
Sasha Vezhnevets, Yuhuai Wu, Maria Eckstein, Rémi Leblond, Joel Z Leibo
Poster
Tue 18:00 Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling
Che Wang, Yanqiu Wu, Quan Vuong, Keith Ross
Poster
Tue 18:00 Intrinsic Reward Driven Imitation Learning via Generative Model
Xingrui Yu, Yueming LYU, Ivor Tsang
Poster
Tue 18:00 Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?
Kei Ota, Tomoaki Oiki, Devesh Jha, Toshisada Mariyama, Daniel Nikovski
Poster
Wed 5:00 Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
Jie Xu, Yunsheng Tian, Pingchuan Ma, Daniela Rus, Shinjiro Sueda, Wojciech Matusik
Poster
Wed 5:00 Learning What to Defer for Maximum Independent Sets
Sungsoo Ahn, Younggyo Seo, Jinwoo Shin
Poster
Wed 5:00 Reinforcement Learning for Integer Programming: Learning to Cut
Yunhao Tang, Shipra Agrawal, Yuri Faenza
Poster
Wed 5:00 Implicit Generative Modeling for Efficient Exploration
Neale Ratzlaff, Qinxun Bai, Fuxin Li, Wei Xu
Poster
Wed 5:00 An Optimistic Perspective on Offline Deep Reinforcement Learning
Rishabh Agarwal, Dale Schuurmans, Mohammad Norouzi
Poster
Wed 5:00 Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto, Francis Song, Jack Rae, Razvan Pascanu, Caglar Gulcehre, Siddhant Jayakumar, Max Jaderberg, Raphael Lopez Kaufman, Aidan Clark, Seb Noury, Matthew Botvinick, Nicolas Heess, Raia Hadsell
Poster
Wed 8:00 Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning
Kimin Lee, Younggyo Seo, Seunghyun Lee, Honglak Lee, Jinwoo Shin
Poster
Wed 8:00 Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings
Jesse Zhang, Brian Cheung, Chelsea Finn, Sergey Levine, Dinesh Jayaraman
Poster
Wed 8:00 Planning to Explore via Self-Supervised World Models
Ramanan Sekar, Oleg Rybkin, Kostas Daniilidis, Pieter Abbeel, Danijar Hafner, Deepak Pathak
Poster
Wed 9:00 Leveraging Procedural Generation to Benchmark Reinforcement Learning
Karl Cobbe, Chris Hesse, Jacob Hilton, John Schulman
Poster
Wed 10:00 A Game Theoretic Framework for Model Based Reinforcement Learning
Aravind Rajeswaran, Igor Mordatch, Vikash Kumar
Poster
Wed 11:00 Revisiting Fundamentals of Experience Replay
William Fedus, Prajit Ramachandran, Rishabh Agarwal, Yoshua Bengio, Hugo Larochelle, Mark Rowland, Will Dabney
Poster
Wed 12:00 Deep Coordination Graphs
Wendelin Boehmer, Vitaly Kurin, Shimon Whiteson
Poster
Wed 13:00 Fast Adaptation to New Environments via Policy-Dynamics Value Functions
Roberta Raileanu, Max Goldstein, Arthur Szlam, Facebook Rob Fergus
Poster
Wed 13:00 Interference and Generalization in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau, Doina Precup
Poster
Wed 15:00 Agent57: Outperforming the Atari Human Benchmark
Adrià Puigdomenech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan Guo, Charles Blundell
Poster
Wed 16:00 Bidirectional Model-based Policy Optimization
Hang Lai, Jian Shen, Weinan Zhang, Yong Yu
Poster
Thu 6:00 Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Vitchyr Pong, Murtaza Dalal, Steven Lin, Ashvin Nair, Shikhar Bahl, Sergey Levine
Poster
Thu 6:00 Deep Reinforcement Learning with Smooth Policy
Qianli Shen, Yan Li, Haoming Jiang, Zhaoran Wang, Tuo Zhao
Poster
Thu 6:00 Provably Efficient Model-based Policy Adaptation
Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao
Poster
Thu 6:00 Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Silviu Pitis, Harris Chan, Stephen Zhao, Bradly Stadie, Jimmy Ba
Poster
Thu 7:00 Goal-Aware Prediction: Learning to Model What Matters
Suraj Nair, Silvio Savarese, Chelsea Finn
Poster
Thu 8:00 One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control
Wenlong Huang, Igor Mordatch, Deepak Pathak
Poster
Thu 8:00 Learning Human Objectives by Evaluating Hypothetical Behavior
Siddharth Reddy, Anca Dragan, Sergey Levine, Shane Legg, Jan Leike
Poster
Thu 9:00 CURL: Contrastive Unsupervised Representations for Reinforcement Learning
Michael Laskin, Aravind Srinivas, Pieter Abbeel
Poster
Thu 15:00 Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt, Matteo Hessel, Karen Simonyan
Poster
Thu 18:00 Learning Efficient Multi-agent Communication: An Information Bottleneck Approach
Rundong Wang, Xu He, Runsheng Yu, Wei Qiu, Bo An, Zinovi Rabinovich