Workshop
ICML 2021 Workshop on Unsupervised Reinforcement Learning
Feryal Behbahani · Joelle Pineau · Lerrel Pinto · Roberta Raileanu · Aravind Srinivas · Denis Yarats · Amy Zhang
Fri 23 Jul, 5:45 a.m. PDT
Unsupervised learning has begun to deliver on its promise in the recent past with tremendous progress made in the fields of natural language processing and computer vision whereby large scale unsupervised pre-training has enabled fine-tuning to downstream supervised learning tasks with limited labeled data. This is particularly encouraging and appealing in the context of reinforcement learning considering that it is expensive to perform rollouts in the real world with annotations either in the form of reward signals or human demonstrations. We therefore believe that a workshop in the intersection of unsupervised and reinforcement learning is timely and we hope to bring together researchers with diverse views on how to make further progress in this exciting and open-ended subfield.
Schedule
Fri 5:45 a.m. - 6:00 a.m.
|
Opening remarks
(
Opening remarks
)
>
|
🔗 |
Fri 6:00 a.m. - 6:30 a.m.
|
Invited Talk by David Ha
(
Invited talk
)
>
|
David Ha 🔗 |
Fri 6:30 a.m. - 7:00 a.m.
|
Invited Talk by Alessandro Lazaric
(
Invited talk
)
>
SlidesLive Video |
Alessandro Lazaric 🔗 |
Fri 7:00 a.m. - 7:30 a.m.
|
Invited Talk by Kelsey Allen
(
Invited talk
)
>
SlidesLive Video |
Kelsey Allen 🔗 |
Fri 7:30 a.m. - 8:30 a.m.
|
Coffee break and Poster Session
(
Poster Session
)
>
|
🔗 |
Fri 8:30 a.m. - 9:00 a.m.
|
Invited Talk by Danijar Hafner
(
Invited talk
)
>
SlidesLive Video |
Danijar Hafner 🔗 |
Fri 9:00 a.m. - 9:30 a.m.
|
Invited Talk by Nan Rosemary Ke
(
Invited talk
)
>
SlidesLive Video |
Rosemary Nan Ke 🔗 |
Fri 9:30 a.m. - 10:30 a.m.
|
Lunch and Poster Session
(
Poster session
)
>
|
🔗 |
Fri 10:30 a.m. - 10:50 a.m.
|
Oral Presentation: Discovering and Achieving Goals with World Models
(
Oral presentation
)
>
SlidesLive Video |
Oleh Rybkin · Deepak Pathak 🔗 |
Fri 10:50 a.m. - 11:10 a.m.
|
Oral Presentation: Planning from Pixels in Environments with Combinatorially Hard Search Spaces
(
Oral Presentation
)
>
SlidesLive Video |
Georg Martius · Marco Bagatella 🔗 |
Fri 11:10 a.m. - 11:30 a.m.
|
Oral Presentation: Learning Task Agnostic Skills with Data-driven Guidance
(
Oral Presentation
)
>
SlidesLive Video |
Even Klemsdal · Abdulmajid Murad 🔗 |
Fri 11:30 a.m. - 12:00 p.m.
|
Invited Talk by Kianté Brantley
(
Invited talk
)
>
SlidesLive Video |
Kiante Brantley 🔗 |
Fri 12:00 p.m. - 1:00 p.m.
|
Coffee break and Poster Session
(
Poster session
)
>
|
🔗 |
Fri 1:00 p.m. - 1:30 p.m.
|
Invited Talk by Chelsea Finn
(
Invited talk
)
>
SlidesLive Video |
Chelsea Finn 🔗 |
Fri 1:30 p.m. - 2:00 p.m.
|
Invited Talk by Pieter Abbeel
(
Invited talk
)
>
SlidesLive Video |
Pieter Abbeel 🔗 |
Fri 2:00 p.m. - 2:30 p.m.
|
Panel Discussion
(
Panel Discussion
)
>
SlidesLive Video |
Rosemary Nan Ke · Danijar Hafner · Pieter Abbeel · Chelsea Finn · Chelsea Finn 🔗 |
-
|
Reward-Free Policy Space Compression for Reinforcement Learning
(
Poster
)
>
|
Mirco Mutti · Stefano Del Col · Marcello Restelli 🔗 |
-
|
Learning to Explore Multiple Environments without Rewards
(
Poster
)
>
|
Mirco Mutti · Mattia Mancassola · Marcello Restelli 🔗 |
-
|
The Importance of Non-Markovianity in Maximum State Entropy Exploration
(
Poster
)
>
|
Mirco Mutti · Riccardo De Santi · Marcello Restelli 🔗 |
-
|
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
(
Poster
)
>
|
Pierre-Alexandre Kamienny · Jean Tarbouriech · Alessandro Lazaric · Ludovic Denoyer 🔗 |
-
|
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
(
Poster
)
>
|
Evgenii Nikishin · Romina Abachi · Rishabh Agarwal · Pierre-Luc Bacon 🔗 |
-
|
Planning from Pixels in Environments with Combinatorially Hard Search Spaces
(
Poster
)
>
|
Marco Bagatella · Miroslav Olšák · Michal Rolinek · Georg Martius 🔗 |
-
|
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
(
Poster
)
>
|
Víctor Campos · Pablo Sprechmann · Steven Hansen · Andre Barreto · Steven Kapturowski · Alex Vitvitskyi · Adrià Puigdomenech Badia · Charles Blundell 🔗 |
-
|
Pretrained Encoders are All You Need
(
Poster
)
>
|
Mina Khan · Advait Rane · Srivatsa P · Shriram Chenniappa · Rishabh Anand · Sherjil Ozair · Patricia Maes 🔗 |
-
|
Density-Based Bonuses on Learned Representations for Reward-Free Exploration in Deep Reinforcement Learning
(
Poster
)
>
|
Omar Darwiche Domingues · Corentin Tallec · Remi Munos · Michal Valko 🔗 |
-
|
Learning Task Agnostic Skills with Data-driven Guidance
(
Poster
)
>
|
Even Klemsdal · Sverre Herland · Abdulmajid Murad 🔗 |
-
|
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
(
Poster
)
>
|
Nicklas Hansen · Hao Su · Xiaolong Wang 🔗 |
-
|
Disentangled Predictive Representation for Meta-Reinforcement Learning
(
Poster
)
>
|
Sephora Madjiheurem · Laura Toni 🔗 |
-
|
Decoupling Exploration and Exploitation in Reinforcement Learning
(
Poster
)
>
|
Lukas Schäfer · Filippos Christianos · Josiah Hanna · Stefano V. Albrecht 🔗 |
-
|
Tangent Space Least Adaptive Clustering
(
Poster
)
>
|
James Buenfil · Samson Koelle · Marina Meila 🔗 |
-
|
Representation Learning for Out-of-distribution Generalization in Downstream Tasks
(
Poster
)
>
|
Frederik Träuble · Andrea Dittadi · Manuel Wuthrich · Felix Widmaier · Peter V Gehler · Ole Winther · Francesco Locatello · Olivier Bachem · Bernhard Schölkopf · Stefan Bauer 🔗 |
-
|
SparseDice: Imitation Learning for Temporally Sparse Data via Regularization
(
Poster
)
>
|
Alberto Camacho · Izzeddin Gur · Marcin Moczulski · Ofir Nachum · Aleksandra Faust 🔗 |
-
|
CoBERL: Contrastive BERT for Reinforcement Learning
(
Poster
)
>
|
Andrea Banino · Adrià Puigdomenech Badia · Jacob C Walker · Tim Scholtes · Jovana Mitrovic · Charles Blundell 🔗 |
-
|
Visual Adversarial Imitation Learning using Variational Models
(
Poster
)
>
|
Rafael Rafailov · Tianhe (Kevin) Yu · Aravind Rajeswaran · Chelsea Finn 🔗 |
-
|
Discovering and Achieving Goals with World Models
(
Poster
)
>
|
Russell Mendonca · Oleh Rybkin · Kostas Daniilidis · Danijar Hafner · Deepak Pathak 🔗 |
-
|
Decision Transformer: Reinforcement Learning via Sequence Modeling
(
Poster
)
>
|
Lili Chen · Kevin Lu · Aravind Rajeswaran · Kimin Lee · Aditya Grover · Michael Laskin · Pieter Abbeel · Aravind Srinivas · Igor Mordatch 🔗 |
-
|
Episodic Memory for Subjective-Timescale Models
(
Poster
)
>
|
Alexey Zakharov · Matthew Crosby · Zafeirios Fountas 🔗 |
-
|
Reinforcement Learning as One Big Sequence Modeling Problem
(
Poster
)
>
|
Michael Janner · Qiyang Li · Sergey Levine 🔗 |
-
|
Exploration-Driven Representation Learning in Reinforcement Learning
(
Poster
)
>
|
Akram Erraqabi · Mingde Zhao · Marlos C. Machado · Yoshua Bengio · Sainbayar Sukhbaatar · Ludovic Denoyer · Alessandro Lazaric 🔗 |
-
|
Learning to Represent State with Perceptual Schemata
(
Poster
)
>
|
Wilka T Carvalho · Murray Shanahan 🔗 |
-
|
Data-Efficient Exploration with Self Play for Atari
(
Poster
)
>
|
Michael Laskin · Catherine Cang · Ryan Rudes · Pieter Abbeel 🔗 |
-
|
Intrinsic Control of Variational Beliefs in Dynamic Partially-Observed Visual Environments
(
Poster
)
>
|
Nicholas Rhinehart · Jenny Wang · Glen Berseth · John Co-Reyes · Danijar Hafner · Chelsea Finn · Sergey Levine 🔗 |
-
|
Discovering Diverse Nearly Optimal Policies with Successor Features
(
Poster
)
>
|
Tom Zahavy · Brendan O'Donoghue · Andre Barreto · Sebastian Flennerhag · Vlad Mnih · Satinder Singh 🔗 |
-
|
Reward is enough for convex MDPs
(
Poster
)
>
|
Tom Zahavy · Brendan O'Donoghue · Guillaume Desjardins · Satinder Singh 🔗 |
-
|
Unsupervised Skill-Discovery and Skill-Learning in Minecraft
(
Poster
)
>
|
Juan José Nieto · Roger Creus Castanyer · Xavier Giro-i-Nieto 🔗 |
-
|
Learning Task-Relevant Representations with Selective Contrast for Reinforcement Learning in a Real-World Application
(
Poster
)
>
|
Flemming Brieger · Daniel A Braun · Sascha Lange 🔗 |
-
|
Exploration via Empowerment Gain: Combining Novelty, Surprise and Learning Progress
(
Poster
)
>
|
Philip Becker-Ehmck · Maximilian Karl · Jan Peters · Patrick van der Smagt 🔗 |
-
|
MASAI: Multi-agent Summative Assessment Improvement for Unsupervised Environment Design
(
Poster
)
>
|
Yiping Wang · Brandon Haworth 🔗 |
-
|
Explore and Control with Adversarial Surprise
(
Poster
)
>
|
Arnaud Fickinger · Natasha Jaques · Samyak Parajuli · Michael Chang · Nicholas Rhinehart · Glen Berseth · Stuart Russell · Sergey Levine 🔗 |
-
|
When Does Overconservatism Hurt Offline Learning?
(
Poster
)
>
|
Karush Suri · Florian Shkurti 🔗 |
-
|
Hierarchical Few-Shot Imitation with Skill Transition Models
(
Poster
)
>
|
kourosh hakhamaneshi · Ruihan Zhao · Albert Zhan · Pieter Abbeel · Michael Laskin 🔗 |
-
|
Exploration and preference satisfaction trade-off in reward-free learning
(
Poster
)
>
|
Noor Sajid · Panagiotis Tigas · Alexey Zakharov · Zafeirios Fountas · Karl Friston 🔗 |
-
|
Visualizing MuZero Models
(
Poster
)
>
|
joery de Vries · Ken Voskuil · Thomas M Moerland · Aske Plaat 🔗 |
-
|
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
(
Poster
)
>
|
Oriol Corcoll · Raul Vicente 🔗 |