Toggle Poster Visibility
Oral
Thu Jul 21 10:30 AM -- 10:50 AM (PDT) @ Room 327 - 329
Generalised Policy Improvement with Geometric Policy Composition
Spotlight
Thu Jul 21 10:50 AM -- 10:55 AM (PDT) @ Room 327 - 329
Offline Meta-Reinforcement Learning with Online Self-Supervision
Spotlight
Thu Jul 21 10:55 AM -- 11:00 AM (PDT) @ Room 327 - 329
Divergence-Regularized Multi-Agent Actor-Critic
[
Paper PDF]
Spotlight
Thu Jul 21 11:00 AM -- 11:05 AM (PDT) @ Room 327 - 329
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach
Spotlight
Thu Jul 21 11:05 AM -- 11:10 AM (PDT) @ Room 327 - 329
Off-Policy Reinforcement Learning with Delayed Rewards
Spotlight
Thu Jul 21 11:10 AM -- 11:15 AM (PDT) @ Room 327 - 329
Direct Behavior Specification via Constrained Reinforcement Learning
Oral
Thu Jul 21 11:15 AM -- 11:35 AM (PDT) @ Room 327 - 329
Large Batch Experience Replay
Spotlight
Thu Jul 21 11:35 AM -- 11:40 AM (PDT) @ Room 327 - 329
Evolving Curricula with Regret-Based Environment Design
Spotlight
Thu Jul 21 11:40 AM -- 11:45 AM (PDT) @ Room 327 - 329
Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum
Spotlight
Thu Jul 21 11:45 AM -- 11:50 AM (PDT) @ Room 327 - 329
Transformers are Meta-Reinforcement Learners
Spotlight
Thu Jul 21 11:50 AM -- 11:55 AM (PDT) @ Room 327 - 329
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
[
Paper PDF]
Spotlight
Thu Jul 21 11:55 AM -- 12:00 PM (PDT) @ Room 327 - 329
Constrained Variational Policy Optimization for Safe Reinforcement Learning