firstbacksecondback
Filter by Keyword:
119 Results
Spotlight
|
Wed 5:45 |
Accelerating Safe Reinforcement Learning with Constraint-mismatched Baseline Policies Jimmy Yang · Justinian Rosca · Karthik Narasimhan · Peter Ramadge |
|
Poster
|
Tue 21:00 |
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training Kimin Lee · Laura Smith · Pieter Abbeel |
|
Poster
|
Tue 9:00 |
Model-Based Reinforcement Learning via Latent-Space Collocation Oleh Rybkin · Chuning Zhu · Anusha Nagabandi · Kostas Daniilidis · Igor Mordatch · Sergey Levine |
|
Poster
|
Tue 9:00 |
State Entropy Maximization with Random Encoders for Efficient Exploration Younggyo Seo · Lili Chen · Jinwoo Shin · Honglak Lee · Pieter Abbeel · Kimin Lee |
|
Oral
|
Tue 18:00 |
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training Kimin Lee · Laura Smith · Pieter Abbeel |
|
Spotlight
|
Tue 7:30 |
Offline Reinforcement Learning with Pseudometric Learning Robert Dadashi · Shideh Rezaeifar · Nino Vieillard · Léonard Hussenot · Olivier Pietquin · Matthieu Geist |
|
Spotlight
|
Tue 19:25 |
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning Hiroki Furuta · Tatsuya Matsushima · Tadashi Kozuno · Yutaka Matsuo · Sergey Levine · Ofir Nachum · Shixiang Gu |
|
Spotlight
|
Tue 19:35 |
MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning Kevin Li · Abhishek Gupta · Ashwin D Reddy · Vitchyr Pong · Aurick Zhou · Justin Yu · Sergey Levine |
|
Spotlight
|
Tue 18:45 |
Reinforcement Learning of Implicit and Explicit Control Flow Instructions Ethan Brooks · Janarthanan Rajendran · Richard Lewis · Satinder Singh |
|
Poster
|
Wed 9:00 |
Accelerating Safe Reinforcement Learning with Constraint-mismatched Baseline Policies Jimmy Yang · Justinian Rosca · Karthik Narasimhan · Peter Ramadge |
|
Poster
|
Wed 9:00 |
Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective Florin Gogianu · Tudor Berariu · Mihaela Rosca · Claudia Clopath · Lucian Busoniu · Razvan Pascanu |
|
Poster
|
Tue 9:00 |
Offline Reinforcement Learning with Pseudometric Learning Robert Dadashi · Shideh Rezaeifar · Nino Vieillard · Léonard Hussenot · Olivier Pietquin · Matthieu Geist |