Skip to yearly menu bar Skip to main content


(12 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Fri Jul 22 02:30 AM -- 02:50 AM (KST) @ Room 327 - 329
Generalised Policy Improvement with Geometric Policy Composition
Shantanu Thakoor · Mark Rowland · Diana Borsa · Will Dabney · Remi Munos · Andre Barreto
Spotlight
Fri Jul 22 02:50 AM -- 02:55 AM (KST) @ Room 327 - 329
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr Pong · Ashvin Nair · Laura Smith · Catherine Huang · Sergey Levine
Spotlight
Fri Jul 22 02:55 AM -- 03:00 AM (KST) @ Room 327 - 329
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su · Zongqing Lu
Spotlight
Fri Jul 22 03:00 AM -- 03:05 AM (KST) @ Room 327 - 329
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach
Shuang Wu · Ling Shi · Jun Wang · Guangjian Tian
Spotlight
Fri Jul 22 03:05 AM -- 03:10 AM (KST) @ Room 327 - 329
Off-Policy Reinforcement Learning with Delayed Rewards
Beining Han · Zhizhou Ren · Zuofan Wu · Yuan Zhou · Jian Peng
Spotlight
Fri Jul 22 03:10 AM -- 03:15 AM (KST) @ Room 327 - 329
Direct Behavior Specification via Constrained Reinforcement Learning
Julien Roy · Roger Girgis · Joshua Romoff · Pierre-Luc Bacon · Christopher Pal
Oral
Fri Jul 22 03:15 AM -- 03:35 AM (KST) @ Room 327 - 329
Large Batch Experience Replay
Thibault Lahire · Matthieu Geist · Emmanuel Rachelson
Spotlight
Fri Jul 22 03:35 AM -- 03:40 AM (KST) @ Room 327 - 329
Evolving Curricula with Regret-Based Environment Design
Jack Parker-Holder · Minqi Jiang · Michael Dennis · Mikayel Samvelyan · Jakob Foerster · Edward Grefenstette · Tim Rocktäschel
Spotlight
Fri Jul 22 03:40 AM -- 03:45 AM (KST) @ Room 327 - 329
Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum
Junlin Wu · Yevgeniy Vorobeychik
Spotlight
Fri Jul 22 03:45 AM -- 03:50 AM (KST) @ Room 327 - 329
Transformers are Meta-Reinforcement Learners
Luckeciano Melo
Spotlight
Fri Jul 22 03:50 AM -- 03:55 AM (KST) @ Room 327 - 329
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang · Yaosheng Xu · Stephen Mcaleer · Dailin Hu · Alexander Ihler · Pieter Abbeel · Roy Fox
Spotlight
Fri Jul 22 03:55 AM -- 04:00 AM (KST) @ Room 327 - 329
Constrained Variational Policy Optimization for Safe Reinforcement Learning
Zuxin Liu · Zhepeng Cen · Vladislav Isenbaev · Wei Liu · Steven Wu · Bo Li · Ding Zhao