97 Results

AffinityWorkshop
Mon 11:00 Invited Talk: Doina Precup on Building Knowledge for AI Agents with Reinforcement Learning
Doina Precup
Poster
Tue 7:00 Q-value Path Decomposition for Deep Multiagent Reinforcement Learning
Yaodong Yang, Jianye Hao, Guangyong Chen, Hongyao Tang, Yingfeng Chen, Yujing Hu, Changjie Fan, Zhongyu Wei
Poster
Tue 7:00 Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Xiaotian Hao, Zhaoqing Peng, Yi Ma, Guan Wang, Junqi Jin, Jianye Hao, Shan Chen, Rongquan Bai, Mingzhou Xie, Miao Xu, Zhenzhe Zheng, Chuan Yu, HAN LI, Jian Xu, Kun Gai
Poster
Tue 7:00 Taylor Expansion Policy Optimization
Yunhao Tang, Michal Valko, Remi Munos
Poster
Tue 7:00 What Can Learned Intrinsic Rewards Capture?
Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh
Poster
Tue 8:00 Representations for Stable Off-Policy Reinforcement Learning
Dibya Ghosh, Marc Bellemare
Poster
Tue 8:00 Working Memory Graphs
Ricky Loynd, Roland Fernandez, Asli Celikyilmaz, Adith Swaminathan, Matthew Hausknecht
Poster
Tue 8:00 GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Shangtong Zhang, Bo Liu, Shimon Whiteson
Poster
Tue 8:00 Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit, Ron Meir, Kamil Ciosek
Poster
Tue 8:00 Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards
Umer Siddique, Paul Weng, Matthieu Zimmer
Poster
Tue 8:00 Batch Reinforcement Learning with Hyperparameter Gradients
Byung-Jun Lee, Jongmin Lee, Peter Vrancx, Dongho Kim, Kee-Eung Kim
Poster
Tue 9:00 Description Based Text Classification with Reinforcement Learning
Duo Chai, Wei Wu, Qinghong Han, Fei Wu, Jiwei Li
Poster
Tue 9:00 Variational Imitation Learning with Diverse-quality Demonstrations
Voot Tangkaratt, Bo Han, Emti Khan, Masashi Sugiyama
Poster
Tue 9:00 Hierarchically Decoupled Imitation For Morphological Transfer
Donald Hejna, Lerrel Pinto, Pieter Abbeel
Poster
Tue 9:00 Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination
Somdeb Majumdar, Shauharda Khadka, Santiago Miret, Stephen Mcaleer, Kagan Tumer
Poster
Tue 9:00 Enhanced POET: Open-ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
Rui Wang, Joel Lehman, Aditya Rawal, Jiale Zhi, Yulun Li, Jeffrey Clune, Ken Stanley
Poster
Tue 9:00 Learning Compound Tasks without Task-specific Knowledge via Imitation and Self-supervised Learning
Sang-Hyun Lee, Seung-Woo Seo
Poster
Tue 9:00 Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences
Daniel Brown, Russell Coleman, Ravi Srinivasan, Scott Niekum
Poster
Tue 9:00 A Markov Decision Process Model for Socio-Economic Systems Impacted by Climate Change
Salman Sadiq Shuvo, Yasin Yilmaz, Alan Bush, Mark Hafen
Poster
Tue 10:00 Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Zhaohan Guo, Bernardo Avila Pires, Bilal Piot, Jean-Bastien Grill, Florent Altché, Remi Munos, Mohammad Gheshlaghi Azar
Poster
Tue 10:00 Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Adam Stooke, Joshua Achiam, Pieter Abbeel
Poster
Tue 11:00 Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills
Victor Campos, Alexander Trott, Caiming Xiong, Richard Socher, Xavier Giro-i-Nieto, Jordi Torres
Poster
Tue 12:00 Ready Policy One: World Building Through Active Learning
Philip Ball, Jack Parker-Holder, Aldo Pacchiano, Krzysztof Choromanski, Stephen Roberts
Poster
Tue 12:00 Multi-step Greedy Reinforcement Learning Algorithms
Manan Tomar, Yonathan Efroni, Mohammad Ghavamzadeh
Poster
Tue 12:00 A distributional view on multi-objective policy optimization
Abbas Abdolmaleki, Sandy Huang, Leonard Hasenclever, Michael Neunert, Francis Song, Martina Zambelli, Murilo Martins, Nicolas Heess, Raia Hadsell, Martin Riedmiller
Poster
Tue 12:00 Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov, Pavel Shvechikov, Alexander Grishin, Dmitry Vetrov
Poster
Tue 13:00 Growing Action Spaces
Gregory Farquhar, Laura Gustafson, Zeming Lin, Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve
Poster
Tue 13:00 Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko, Zhehui Huang, Tushar Kumar, Gaurav Sukhatme, Vladlen Koltun
Poster
Tue 13:00 Generalization to New Actions in Reinforcement Learning
Ayush Jain, Andrew Szot, Joseph Lim
Poster
Tue 14:00 Knowing The What But Not The Where in Bayesian Optimization
Vu Nguyen, Michael A Osborne
Poster
Tue 14:00 OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning
Sasha Vezhnevets, Yuhuai Wu, Maria Eckstein, Rémi Leblond, Joel Z Leibo
Poster
Tue 18:00 Tuning-free Plug-and-Play Proximal Algorithm for Inverse Imaging Problems
Kaixuan Wei, Angelica I Aviles-Rivero, Jingwei Liang, Ying Fu, Carola-Bibiane Schönlieb, Hua Huang
Poster
Tue 18:00 Intrinsic Reward Driven Imitation Learning via Generative Model
Xingrui Yu, Yueming LYU, Ivor Tsang
Poster
Tue 18:00 Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?
Kei Ota, Tomoaki Oiki, Devesh Jha, Toshisada Mariyama, Daniel Nikovski
Poster
Tue 18:00 Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling
Che Wang, Yanqiu Wu, Quan Vuong, Keith Ross
Poster
Wed 5:00 Implicit Generative Modeling for Efficient Exploration
Neale Ratzlaff, Qinxun Bai, Fuxin Li, Wei Xu
Poster
Wed 5:00 Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto, Francis Song, Jack Rae, Razvan Pascanu, Caglar Gulcehre, Siddhant Jayakumar, Max Jaderberg, Raphael Lopez Kaufman, Aidan Clark, Seb Noury, Matthew Botvinick, Nicolas Heess, Raia Hadsell
Poster
Wed 5:00 An Optimistic Perspective on Offline Deep Reinforcement Learning
Rishabh Agarwal, Dale Schuurmans, Mohammad Norouzi
Poster
Wed 5:00 Adaptive Droplet Routing in Digital Microfluidic Biochips Using Deep Reinforcement Learning
Tung-Che Liang, Zhanwei Zhong, Yaas Bigdeli, Tsung-Yi Ho, Krishnendu Chakrabarty, Richard Fair
Poster
Wed 5:00 Reinforcement Learning for Integer Programming: Learning to Cut
Yunhao Tang, Shipra Agrawal, Yuri Faenza
Poster
Wed 5:00 Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
Jie Xu, Yunsheng Tian, Pingchuan Ma, Daniela Rus, Shinjiro Sueda, Wojciech Matusik
Poster
Wed 5:00 Learning What to Defer for Maximum Independent Sets
Sungsoo Ahn, Younggyo Seo, Jinwoo Shin
Poster
Wed 8:00 Learning to Score Behaviors for Guided Policy Optimization
Aldo Pacchiano, Jack Parker-Holder, Yunhao Tang, Krzysztof Choromanski, Anna Choromanska, Michael Jordan
Poster
Wed 8:00 Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning
Kimin Lee, Younggyo Seo, Seunghyun Lee, Honglak Lee, Jinwoo Shin
Poster
Wed 8:00 Planning to Explore via Self-Supervised World Models
Ramanan Sekar, Oleg Rybkin, Kostas Daniilidis, Pieter Abbeel, Danijar Hafner, Deepak Pathak
Poster
Wed 8:00 Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings
Jesse Zhang, Brian Cheung, Chelsea Finn, Sergey Levine, Dinesh Jayaraman
Poster
Wed 9:00 Flexible and Efficient Long-Range Planning Through Curious Exploration
Aidan Curtis, Minjian Xin, Dilip Arumugam, Kevin Feigelis, Daniel Yamins
Poster
Wed 9:00 Leveraging Procedural Generation to Benchmark Reinforcement Learning
Karl Cobbe, Chris Hesse, Jacob Hilton, John Schulman
Poster
Wed 9:00 Hallucinative Topological Memory for Zero-Shot Visual Planning
Kara Liu, Thanard Kurutach, Christine Tung, Pieter Abbeel, Aviv Tamar
Poster
Wed 10:00 A Game Theoretic Framework for Model Based Reinforcement Learning
Aravind Rajeswaran, Igor Mordatch, Vikash Kumar
Poster
Wed 11:00 A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation
Pan Xu, Quanquan Gu
Poster
Wed 11:00 Revisiting Fundamentals of Experience Replay
William Fedus, Prajit Ramachandran, Rishabh Agarwal, Yoshua Bengio, Hugo Larochelle, Mark Rowland, Will Dabney
Poster
Wed 12:00 Deep Coordination Graphs
Wendelin Boehmer, Vitaly Kurin, Shimon Whiteson
Poster
Wed 12:00 CoMic: Complementary Task Learning & Mimicry for Reusable Skills
Leonard Hasenclever, Fabio Pardo, Raia Hadsell, Nicolas Heess, Josh Merel
Poster
Wed 12:00 Inductive-bias-driven Reinforcement Learning For Efficient Schedules in Heterogeneous Clusters
Subho Banerjee, Saurabh Jha, Zbigniew Kalbarczyk, Ravishankar Iyer
Poster
Wed 13:00 Fast Adaptation to New Environments via Policy-Dynamics Value Functions
Roberta Raileanu, Max Goldstein, Arthur Szlam, Facebook Rob Fergus
Poster
Wed 13:00 Interference and Generalization in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau, Doina Precup
Poster
Wed 15:00 Agent57: Outperforming the Atari Human Benchmark
Adrià Puigdomenech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan Guo, Charles Blundell
Poster
Wed 16:00 Bidirectional Model-based Policy Optimization
Hang Lai, Jian Shen, Weinan Zhang, Yong Yu
Poster
Thu 6:00 Deep Reinforcement Learning with Smooth Policy
Qianli Shen, Yan Li, Haoming Jiang, Zhaoran Wang, Tuo Zhao
Poster
Thu 6:00 Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Vitchyr Pong, Murtaza Dalal, Steven Lin, Ashvin Nair, Shikhar Bahl, Sergey Levine
Poster
Thu 6:00 On the Expressivity of Neural Networks for Deep Reinforcement Learning
Kefan Dong, Yuping Luo, Tianhe Yu, Chelsea Finn, Tengyu Ma
Poster
Thu 6:00 Provably Efficient Model-based Policy Adaptation
Yuda Song, Aditi Mavalankar, Wen Sun, Sicun Gao
Poster
Thu 6:00 Learning Robot Skills with Temporal Variational Inference
Tanmay Shankar, Abhinav Gupta
Poster
Thu 6:00 Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
Silviu Pitis, Harris Chan, Stephen Zhao, Bradly Stadie, Jimmy Ba
Poster
Thu 6:00 Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija, Philip Amortila, Joelle Pineau
Poster
Thu 6:00 GraphOpt: Learning Optimization Models of Graph Formation
Rakshit Trivedi, Jiachen Yang, Hongyuan Zha
Poster
Thu 6:00 Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Ashley Edwards, Himanshu Sahni, Rosanne Liu, Jane Hung, Ankit Jain, Rui Wang, Adrien Ecoffet, Thomas Miconi, Charles Isbell, Jason Yosinski
Poster
Thu 7:00 ConQUR: Mitigating Delusional Bias in Deep Q-Learning
DiJia Su, Jayden Ooi, Tyler Lu, Dale Schuurmans, Craig Boutilier
Poster
Thu 7:00 Goal-Aware Prediction: Learning to Model What Matters
Suraj Nair, Silvio Savarese, Chelsea Finn
Poster
Thu 8:00 Learning Human Objectives by Evaluating Hypothetical Behavior
Siddharth Reddy, Anca Dragan, Sergey Levine, Shane Legg, Jan Leike
Poster
Thu 8:00 One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control
Wenlong Huang, Igor Mordatch, Deepak Pathak
Poster
Thu 8:00 Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation
Shangtong Zhang, Bo Liu, Hengshuai Yao, Shimon Whiteson
Poster
Thu 8:00 An Imitation Learning Approach for Cache Replacement
Evan Liu, Milad Hashemi, Kevin Swersky, Partha Ranganathan, Junwhan Ahn
Poster
Thu 9:00 Learning to Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning
Sai Krishna Gottipati, Boris Sattarov, Sufeng Niu, Yashaswi Pathak, Haoran Wei, Shengchao Liu, Shengchao Liu, Simon Blackburn, Karam Thomas, Connor Coley, Jian Tang, Sarath Chandar, Yoshua Bengio
Poster
Thu 9:00 CURL: Contrastive Unsupervised Representations for Reinforcement Learning
Michael Laskin, Aravind Srinivas, Pieter Abbeel
Poster
Thu 9:00 Inferring DQN structure for high-dimensional continuous control
Andrey Sakryukin, Chedy Raissi, Mohan Kankanhalli
Poster
Thu 9:00 ROMA: Multi-Agent Reinforcement Learning with Emergent Roles
Tonghan Wang, Heng Dong, Victor Lesser, Chongjie Zhang
Poster
Thu 9:00 Symbolic Network: Generalized Neural Policies for Relational MDPs
Sankalp Garg, Aniket Bajpai, Mausam
Poster
Thu 12:00 Predictive Coding for Locally-Linear Control
Rui Shu, Tung Nguyen, Yinlam Chow, Tuan Pham, Khoat Than, Mohammad Ghavamzadeh, Stefano Ermon, Hung Bui
Poster
Thu 12:00 Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill, Florent Altché, Yunhao Tang, Thomas Hubert, Michal Valko, Ioannis Antonoglou, Remi Munos
Poster
Thu 14:00 Probing Emergent Semantics in Predictive Agents via Question Answering
Abhishek Das, Federico Carnevale, Hamza Merzic, Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Greg Wayne, Feilx Hill
Poster
Thu 14:00 Reinforcement Learning for Molecular Design Guided by Quantum Mechanics
Gregor Simm, Robert Pinsler, Jose Miguel Hernandez-Lobato
Poster
Thu 15:00 Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt, Matteo Hessel, Karen Simonyan
Poster
Thu 18:00 Learning Efficient Multi-agent Communication: An Information Bottleneck Approach
Rundong Wang, Xu He, Runsheng Yu, Wei Qiu, Bo An, Zinovi Rabinovich
Workshop
Fri 9:00 Poster Session (click to see links)
Workshop
Fri 12:05 Spotlight Talk: Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Workshop
Sat 4:00 Virtual Poster Session #1
Workshop
Sat 8:00 Virtual Poster Session #2
Workshop
Sat 9:00 Contributed Talk: Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Annie Xie
Workshop
Exact (Then Approximate) Dynamic Programming for Deep Reinforcement Learning
Henrik Marklund
Workshop
Nesterov Momentum Adversarial Perturbations in the Deep Reinforcement Learning Domain
Ezgi Korkmaz
Workshop
Group Equivariant Deep Reinforcement Learning
Arnab Kumar Mondal
Workshop
Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
Ilya Kostrikov
Workshop
(#101 / Sess. 1) Graph neural induction of value iteration
Andreea Deac
Workshop
Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Workshop
Accepted Papers