firstbacksecondback
178 Results
Spotlight
|
Tue 13:15 |
Model-Free Opponent Shaping Christopher Lu · Timon Willi · Christian Schroeder de Witt · Jakob Foerster |
|
Spotlight
|
Thu 13:45 |
Robust Policy Learning over Multiple Uncertainty Sets Annie Xie · Shagun Sodhani · Chelsea Finn · Joelle Pineau · Amy Zhang |
|
Oral
|
Wed 10:45 |
Planning with Diffusion for Flexible Behavior Synthesis Michael Janner · Yilun Du · Josh Tenenbaum · Sergey Levine |
|
Spotlight
|
Tue 11:55 |
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning Shentao Yang · Yihao Feng · Shujian Zhang · Mingyuan Zhou |
|
Poster
|
Wed 15:30 |
Planning with Diffusion for Flexible Behavior Synthesis Michael Janner · Yilun Du · Josh Tenenbaum · Sergey Levine |
|
Spotlight
|
Thu 13:35 |
A Parametric Class of Approximate Gradient Updates for Policy Optimization Ramki Gummadi · Saurabh Kumar · Junfeng Wen · Dale Schuurmans |
|
Poster
|
Wed 15:30 |
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters Vladislav Kurenkov · Sergey Kolesnikov |
|
Spotlight
|
Wed 8:30 |
Efficient Learning for AlphaZero via Path Consistency Dengwei Zhao · Shikui Tu · Lei Xu |
|
Spotlight
|
Thu 13:50 |
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL Siyi Hu · Chuanlong Xie · Xiaodan Liang · Xiaojun Chang |
|
Poster
|
Tue 15:30 |
Generalized Beliefs for Cooperative AI Darius Muglich · Luisa Zintgraf · Christian Schroeder de Witt · Shimon Whiteson · Jakob Foerster |
|
Poster
|
Thu 15:00 |
Temporal Difference Learning for Model Predictive Control Nicklas Hansen · Hao Su · Xiaolong Wang |
|
Spotlight
|
Thu 13:40 |
How to Leverage Unlabeled Data in Offline Reinforcement Learning Tianhe (Kevin) Yu · Aviral Kumar · Yevgen Chebotar · Karol Hausman · Chelsea Finn · Sergey Levine |