Skip to yearly menu bar Skip to main content


(15 events)   Timezone:  
Show all
Toggle Poster Visibility
Spotlight
Wed Jul 20 07:30 AM -- 07:35 AM (PDT) @ Room 307
Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Athul Paul Jacob · David Wu · Gabriele Farina · Adam Lerer · Hengyuan Hu · Anton Bakhtin · Jacob Andreas · Noam Brown
Spotlight
Wed Jul 20 07:35 AM -- 07:40 AM (PDT) @ Room 307
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
Vladislav Kurenkov · Sergey Kolesnikov
Spotlight
Wed Jul 20 07:40 AM -- 07:45 AM (PDT) @ Room 307
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li · Tian Gao · Jiaqi Yang · Huazhe Xu · Yi Wu
Spotlight
Wed Jul 20 07:45 AM -- 07:50 AM (PDT) @ Room 307
Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models and Amortized Policy Search
Qi Wang · Herke van Hoof
Spotlight
Wed Jul 20 07:50 AM -- 07:55 AM (PDT) @ Room 307
Generalized Data Distribution Iteration
Jiajun Fan · Changnan Xiao
Spotlight
Wed Jul 20 07:55 AM -- 08:00 AM (PDT) @ Room 307
Optimizing Tensor Network Contraction Using Reinforcement Learning
Eli Meirom · Haggai Maron · Shie Mannor · Gal Chechik
Spotlight
Wed Jul 20 08:00 AM -- 08:05 AM (PDT) @ Room 307
History Compression via Language Models in Reinforcement Learning
Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter
Oral
Wed Jul 20 08:05 AM -- 08:25 AM (PDT) @ Room 307
REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer
Xingyu Liu · Deepak Pathak · Kris Kitani
Spotlight
Wed Jul 20 08:25 AM -- 08:30 AM (PDT) @ Room 307
LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation
David Ireland · Giovanni Montana
Spotlight
Wed Jul 20 08:30 AM -- 08:35 AM (PDT) @ Room 307
Efficient Learning for AlphaZero via Path Consistency
Dengwei Zhao · Shikui Tu · Lei Xu
Spotlight
Wed Jul 20 08:35 AM -- 08:40 AM (PDT) @ Room 307
A data-driven approach for learning to control computers
Peter Humphreys · David Raposo · Tobias Pohlen · Gregory Thornton · Rachita Chhaparia · Alistair Muldal · Josh Abramson · Petko Georgiev · Adam Santoro · Timothy Lillicrap
Spotlight
Wed Jul 20 08:40 AM -- 08:45 AM (PDT) @ Room 307
Zero-Shot Reward Specification via Grounded Natural Language
Parsa Mahmoudieh · Deepak Pathak · Trevor Darrell
Spotlight
Wed Jul 20 08:45 AM -- 08:50 AM (PDT) @ Room 307
How to Stay Curious while avoiding Noisy TVs using Aleatoric Uncertainty Estimation
Augustine Mavor-Parker · Kimberly Young · Caswell Barry · Lewis Griffin
Spotlight
Wed Jul 20 08:50 AM -- 08:55 AM (PDT) @ Room 307
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos · Eszter VĂ©rtes · Zita Marinho · Gregory Farquhar · Diana Borsa · Abe Friesen · Feryal Behbahani · Tom Schaul · Andre Barreto · Simon Osindero
Spotlight
Wed Jul 20 08:55 AM -- 09:00 AM (PDT) @ Room 307
Improving Policy Optimization with Generalist-Specialist Learning
Zhiwei Jia · Xuanlin Li · Zhan Ling · Shuang Liu · Yiran Wu · Hao Su