Oral
|
Tue 5:00 |
Phasic Policy Gradient Karl Cobbe · Jacob Hilton · Oleg Klimov · John Schulman |
|
Spotlight
|
Tue 5:20 |
Reinforcement Learning with Prototypical Representations Denis Yarats · Rob Fergus · Alessandro Lazaric · Lerrel Pinto |
|
Spotlight
|
Tue 5:25 |
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration Seungyul Han · Youngchul Sung |
|
Spotlight
|
Tue 5:30 |
Muesli: Combining Improvements in Policy Optimization Matteo Hessel · Ivo Danihelka · Fabio Viola · Arthur Guez · Simon Schmitt · Laurent Sifre · Theophane Weber · David Silver · Hado van Hasselt |
|
Spotlight
|
Tue 5:35 |
Unsupervised Learning of Visual 3D Keypoints for Control Boyuan Chen · Pieter Abbeel · Deepak Pathak |
|
Spotlight
|
Tue 5:40 |
Learning Task Informed Abstractions Xiang Fu · Ge Yang · Pulkit Agrawal · Tommi Jaakkola |
|
Spotlight
|
Tue 5:45 |
State Entropy Maximization with Random Encoders for Efficient Exploration Younggyo Seo · Lili Chen · Jinwoo Shin · Honglak Lee · Pieter Abbeel · Kimin Lee |
|
Spotlight
|
Tue 6:40 |
Bayesian Deep Learning via Subnetwork Inference Erik Daxberger · Eric Nalisnick · James Allingham · Javier AntorĂ¡n · Jose Miguel Hernandez-Lobato |
|
Oral
|
Tue 7:00 |
Skill Discovery for Exploration and Planning using Deep Skill Graphs Akhil Bagaria · Jason Senthil · George Konidaris |
|
Oral
|
Tue 7:00 |
World Model as a Graph: Learning Latent Landmarks for Planning Lunjun Zhang · Ge Yang · Bradly Stadie |
|
Spotlight
|
Tue 7:20 |
Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research Johan Obando Ceron · Pablo Samuel Castro |
|
Spotlight
|
Tue 7:20 |
Learning Routines for Effective Off-Policy Reinforcement Learning Edoardo Cetin · Oya Celiktutan |