Fri 6:00 a.m. - 5:00 p.m.
|
Please visit the workshop website for the full program
(
Program
)
>
link
|
馃敆
|
Fri 6:00 a.m. - 6:20 a.m.
|
Opening Remarks
(
Presentation
)
>
SlidesLive Video
|
馃敆
|
Fri 6:20 a.m. - 7:00 a.m.
|
Differentiable optimization for control and reinforcement learning
(
Invited Talk
)
>
SlidesLive Video
|
Brandon Amos
馃敆
|
Fri 7:00 a.m. - 7:30 a.m.
|
Break
|
馃敆
|
Fri 7:30 a.m. - 8:10 a.m.
|
Discovering RL Algorithms
(
Invited Talk
)
>
SlidesLive Video
|
Junhyuk Oh
馃敆
|
Fri 8:10 a.m. - 9:00 a.m.
|
Discovered Policy Optimisation. Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy. Adaptive Interest for Emphatic Reinforcement Learning
(
Contributed Talks
)
>
|
馃敆
|
Fri 9:00 a.m. - 10:40 a.m.
|
Break
|
馃敆
|
Fri 10:40 a.m. - 11:20 a.m.
|
The Value Equivalence Principle for Model-Based RL
(
Invited Talk
)
>
SlidesLive Video
|
Christopher Grimm
馃敆
|
Fri 11:20 a.m. - 12:00 p.m.
|
A Model-Based Reinforcement Learning Wishlist
(
Invited Talk
)
>
SlidesLive Video
|
Erin Talvitie
馃敆
|
Fri 12:00 p.m. - 12:30 p.m.
|
Break
|
馃敆
|
Fri 12:30 p.m. - 1:30 p.m.
|
DARL Panel
(
Panel Discussion
)
>
SlidesLive Video
|
馃敆
|
Fri 1:30 p.m. - 2:30 p.m.
|
Poster Session
(
In-person only poster presentation
)
>
|
馃敆
|
Fri 2:30 p.m. - 3:10 p.m.
|
Policy Gradient: Theory for Making Best Use of It
(
Invited Talk
)
>
SlidesLive Video
|
Mengdi Wang
馃敆
|
Fri 3:10 p.m. - 3:50 p.m.
|
General-purpose meta learning
(
Invited Talk
)
>
SlidesLive Video
|
Louis Kirsch
馃敆
|
Fri 3:50 p.m. - 5:00 p.m.
|
Closing Remarks & Poster Session
(
Presentation followed by an In-person only poster presentation
)
>
|
馃敆
|
-
|
Effective Offline RL Needs Going Beyond Pessimism: Representations and Distributional Shift
(
Poster
)
>
link
|
Xinyang Geng 路 Kevin Li 路 Abhishek Gupta 路 Aviral Kumar 路 Sergey Levine
馃敆
|
-
|
Hyperbolically Discounted Advantage Estimation for Generalization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Nasik Muhammad Nafi 路 Raja Farrukh Ali 路 William Hsu
馃敆
|
-
|
Deep Policy Generators
(
Poster
)
>
link
|
Francesco Faccio 路 Vincent Herrmann 路 Aditya Ramesh 路 Louis Kirsch 路 J眉rgen Schmidhuber
馃敆
|
-
|
CoMBiNED: Multi-Constrained Model Based Planning for Navigation in Dynamic Environments
(
Poster
)
>
link
SlidesLive Video
|
Harit Pandya 路 Rudra Poudel 路 Stephan Liwicki
馃敆
|
-
|
Exploration Hurts in Bandits with Partially Observed Stochastic Contexts
(
Poster
)
>
link
|
Hongju Park 路 Mohamad Kazem Shirani Faradonbeh
馃敆
|
-
|
Exploration in Reward Machines with Low Regret
(
Poster
)
>
link
SlidesLive Video
|
Hippolyte Bourel 路 Anders Jonsson 路 Odalric-Ambrym Maillard 路 Mohammad Sadegh Talebi
馃敆
|
-
|
Exploring Long-Horizon Reasoning with Deep RL in Combinatorially Hard Tasks
(
Poster
)
>
link
|
Andrew C Li 路 Pashootan Vaezipoor 路 Rodrigo A Toro Icarte 路 Sheila McIlraith
馃敆
|
-
|
VIPer: Iterative Value-Aware Model Learning on the Value Improvement Path
(
Poster
)
>
link
SlidesLive Video
|
Romina Abachi 路 Claas Voelcker 路 Animesh Garg 路 Amir-massoud Farahmand
馃敆
|
-
|
Model-Based Meta Automatic Curriculum Learning
(
Poster
)
>
link
SlidesLive Video
|
Zifan Xu 路 Yulin Zhang 路 Shahaf Shperberg 路 Reuth Mirsky 路 Yuqian Jiang 路 Bo Liu 路 Peter Stone
馃敆
|
-
|
Adaptive Interest for Emphatic Reinforcement Learning
(
Spotlight
)
>
link
SlidesLive Video
|
Martin Klissarov 路 Rasool Fakoor 路 Jonas Mueller 路 Kavosh Asadi 路 Taesup Kim 路 Alex Smola
馃敆
|
-
|
General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States
(
Poster
)
>
link
|
Francesco Faccio 路 Aditya Ramesh 路 Vincent Herrmann 路 Jean Harb 路 J眉rgen Schmidhuber
馃敆
|
-
|
An Investigation into the Open World Survival Game Crafter
(
Poster
)
>
link
SlidesLive Video
|
Aleksandar Stanic 路 Yujin Tang 路 David Ha 路 J眉rgen Schmidhuber
馃敆
|
-
|
Unsupervised Model-based Pre-training for Data-efficient Reinforcement Learning from Pixels
(
Poster
)
>
link
SlidesLive Video
|
Sai Rajeswar 路 Pietro Mazzaglia 路 Tim Verbelen 路 Alex Piche 路 Bart Dhoedt 路 Aaron Courville 路 Alexandre Lacoste
馃敆
|
-
|
Model-Based Reinforcement Learning with SINDy
(
Poster
)
>
link
SlidesLive Video
|
Rushiv Arora 路 Eliot Moss 路 Bruno da Silva
馃敆
|
-
|
Toward Human Cognition-inspired High-Level Decision Making For Hierarchical Reinforcement Learning Agents
(
Poster
)
>
link
SlidesLive Video
|
Rousslan F. J. Dossa 路 Takashi Matsubara
馃敆
|
-
|
MoCoDA: Model-based Counterfactual Data Augmentation
(
Poster
)
>
link
SlidesLive Video
|
Silviu Pitis 路 Elliot Creager 路 Ajay Mandlekar 路 Animesh Garg
馃敆
|
-
|
An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
WOOJUN KIM 路 Youngchul Sung
馃敆
|
-
|
Leader-based Decision Learning for Cooperative Multi-Agent Reinforcement Learning
(
Poster
)
>
link
|
Wenqi Chen 路 Xin Zeng 路 Amber Li
馃敆
|
-
|
Recursive History Representations for Unsupervised Reinforcement Learning in Multiple-Environments
(
Poster
)
>
link
|
Mirco Mutti 路 Pietro Maldini 路 Riccardo De Santi 路 Marcello Restelli
馃敆
|
-
|
Building a Subspace of Policies for Scalable Continual Learning
(
Poster
)
>
link
SlidesLive Video
|
Jean-Baptiste Gaya 路 Thang Doan 路 Lucas Caccia 路 Laure Soulier 路 Ludovic Denoyer 路 Roberta Raileanu
馃敆
|
-
|
DASCO: Dual-Generator Adversarial Support Constrained Offline Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Quan Vuong 路 Aviral Kumar 路 Sergey Levine 路 Yevgen Chebotar
馃敆
|
-
|
Representation Gap in Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Qiang He 路 Huangyuan Su 路 Jieyu Zhang 路 Xinwen Hou
馃敆
|
-
|
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
(
Poster
)
>
link
SlidesLive Video
|
Cong Lu 路 Philip Ball 路 Tim G. J Rudner 路 Jack Parker-Holder 路 Michael A Osborne 路 Yee-Whye Teh
馃敆
|
-
|
Giving Feedback on Interactive Student Programs with Meta-Exploration
(
Poster
)
>
link
SlidesLive Video
|
Evan Liu 路 Moritz Stephan 路 Allen Nie 路 Chris Piech 路 Emma Brunskill 路 Chelsea Finn
馃敆
|
-
|
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning
(
Poster
)
>
link
|
Annie Xie 路 Fahim Tajwar 路 Archit Sharma 路 Chelsea Finn
馃敆
|
-
|
Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions
(
Poster
)
>
link
SlidesLive Video
|
Audrey Huang 路 Nan Jiang
馃敆
|
-
|
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees
(
Poster
)
>
link
SlidesLive Video
|
Siliang Zeng 路 Chenliang Li 路 Alfredo Garcia 路 Mingyi Hong
馃敆
|
-
|
You Can鈥檛 Count on Luck: Why Decision Transformers Fail in Stochastic Environments
(
Poster
)
>
link
SlidesLive Video
|
Keiran Paster 路 Sheila McIlraith 路 Jimmy Ba
馃敆
|
-
|
Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games
(
Poster
)
>
link
SlidesLive Video
|
Dingyang Chen 路 Qi Zhang 路 Thinh Doan
馃敆
|
-
|
Fast Convergence for Unstable Reinforcement Learning Problems by Logarithmic Mapping
(
Poster
)
>
link
SlidesLive Video
|
Wang Zhang 路 Lam Nguyen 路 Subhro Das 路 Alexandre Megretsky 路 Luca Daniel 路 Tsui-Wei Weng
馃敆
|
-
|
Self-Referential Meta Learning
(
Poster
)
>
link
SlidesLive Video
|
Louis Kirsch 路 J眉rgen Schmidhuber
馃敆
|
-
|
Distributionally Adaptive Meta Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Anurag Ajay 路 Dibya Ghosh 路 Sergey Levine 路 Pulkit Agrawal 路 Abhishek Gupta
馃敆
|
-
|
You Only Live Once: Single-Life Reinforcement Learning via Learned Reward Shaping
(
Poster
)
>
link
|
Annie Chen 路 Archit Sharma 路 Sergey Levine 路 Chelsea Finn
馃敆
|
-
|
Discovered Policy Optimisation
(
Spotlight
)
>
link
SlidesLive Video
|
Christopher Lu 路 Jakub Grudzien Kuba 路 Alistair Letcher 路 Luke Metz 路 Christian Schroeder 路 Jakob Foerster
馃敆
|
-
|
Directed Exploration via Uncertainty-Aware Critics
(
Poster
)
>
link
|
Amarildo Likmeta 路 Matteo Sacco 路 Alberto Maria Metelli 路 Marcello Restelli
馃敆
|
-
|
Adversarial Cheap Talk
(
Poster
)
>
link
|
Christopher Lu 路 Timon Willi 路 Alistair Letcher 路 Jakob Foerster
馃敆
|
-
|
Adaptive Intrinsic Motivation with Decision Awareness
(
Poster
)
>
link
SlidesLive Video
|
Suyoung Lee 路 Sae-Young Chung
馃敆
|
-
|
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
(
Poster
)
>
link
SlidesLive Video
|
Shengpu Tang 路 Maggie Makar 路 Michael Sjoding 路 Finale Doshi-Velez 路 Jenna Wiens
馃敆
|
-
|
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
(
Poster
)
>
link
|
Nicolai Dorka 路 Tim Welschehold 路 Wolfram Burgard
馃敆
|
-
|
Task Factorization in Curriculum Learning
(
Poster
)
>
link
SlidesLive Video
|
Reuth Mirsky 路 Shahaf Shperberg 路 Yulin Zhang 路 Zifan Xu 路 Yuqian Jiang 路 Jiaxun Cui 路 Peter Stone
馃敆
|
-
|
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
(
Poster
)
>
link
SlidesLive Video
|
Dylan Slack 路 Yinlam Chow 路 Bo Dai 路 Nevan Wichers
馃敆
|
-
|
Guided Exploration in Reinforcement Learning via Monte Carlo Critic Optimization
(
Poster
)
>
link
SlidesLive Video
|
Igor Kuznetsov
馃敆
|
-
|
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
(
Spotlight
)
>
link
SlidesLive Video
|
xiyao wang 路 Wichayaporn Wongkamjan 路 Furong Huang
馃敆
|
-
|
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning
(
Poster
)
>
link
|
Dilip Arumugam 路 Benjamin Van Roy
馃敆
|
-
|
Generalization of Reinforcement Learning with Policy-Aware Adversarial Data Augmentation
(
Poster
)
>
link
SlidesLive Video
|
Hanping Zhang 路 Yuhong Guo
馃敆
|
-
|
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Qiang He 路 Huangyuan Su 路 Chen GONG 路 Xinwen Hou
馃敆
|