Skip to yearly menu bar Skip to main content


(12 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Jul 21 10:30 AM -- 10:50 AM (PDT) @ Room 327 - 329
Generalised Policy Improvement with Geometric Policy Composition
Shantanu Thakoor · Mark Rowland · Diana Borsa · Will Dabney · Remi Munos · Andre Barreto
Spotlight
Thu Jul 21 10:50 AM -- 10:55 AM (PDT) @ Room 327 - 329
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr Pong · Ashvin Nair · Laura Smith · Catherine Huang · Sergey Levine
Spotlight
Thu Jul 21 10:55 AM -- 11:00 AM (PDT) @ Room 327 - 329
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su · Zongqing Lu
Spotlight
Thu Jul 21 11:00 AM -- 11:05 AM (PDT) @ Room 327 - 329
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach
Shuang Wu · Ling Shi · Jun Wang · Guangjian Tian
Spotlight
Thu Jul 21 11:05 AM -- 11:10 AM (PDT) @ Room 327 - 329
Off-Policy Reinforcement Learning with Delayed Rewards
Beining Han · Zhizhou Ren · Zuofan Wu · Yuan Zhou · Jian Peng
Spotlight
Thu Jul 21 11:10 AM -- 11:15 AM (PDT) @ Room 327 - 329
Direct Behavior Specification via Constrained Reinforcement Learning
Julien Roy · Roger Girgis · Joshua Romoff · Pierre-Luc Bacon · Christopher Pal
Oral
Thu Jul 21 11:15 AM -- 11:35 AM (PDT) @ Room 327 - 329
Large Batch Experience Replay
Thibault Lahire · Matthieu Geist · Emmanuel Rachelson
Spotlight
Thu Jul 21 11:35 AM -- 11:40 AM (PDT) @ Room 327 - 329
Evolving Curricula with Regret-Based Environment Design
Jack Parker-Holder · Minqi Jiang · Michael Dennis · Mikayel Samvelyan · Jakob Foerster · Edward Grefenstette · Tim Rocktäschel
Spotlight
Thu Jul 21 11:40 AM -- 11:45 AM (PDT) @ Room 327 - 329
Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum
Junlin Wu · Yevgeniy Vorobeychik
Spotlight
Thu Jul 21 11:45 AM -- 11:50 AM (PDT) @ Room 327 - 329
Transformers are Meta-Reinforcement Learners
Luckeciano Melo
Spotlight
Thu Jul 21 11:50 AM -- 11:55 AM (PDT) @ Room 327 - 329
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang · Yaosheng Xu · Stephen Mcaleer · Dailin Hu · Alexander Ihler · Pieter Abbeel · Roy Fox
Spotlight
Thu Jul 21 11:55 AM -- 12:00 PM (PDT) @ Room 327 - 329
Constrained Variational Policy Optimization for Safe Reinforcement Learning
Zuxin Liu · Zhepeng Cen · Vladislav Isenbaev · Wei Liu · Steven Wu · Bo Li · Ding Zhao