Toggle Poster Visibility
Oral
Thu Jul 22 06:00 AM -- 06:20 AM (PDT)
Temporal Difference Learning as Gradient Splitting
[ Paper ]
Spotlight
Thu Jul 22 06:20 AM -- 06:25 AM (PDT)
First-Order Methods for Wasserstein Distributionally Robust MDP
[ Paper ]
Spotlight
Thu Jul 22 06:30 AM -- 06:35 AM (PDT)
Adaptive Sampling for Best Policy Identification in Markov Decision Processes
[ Paper ]
Spotlight
Thu Jul 22 06:35 AM -- 06:40 AM (PDT)
Quantum algorithms for reinforcement learning with a generative model
[ Paper ]
Spotlight
Thu Jul 22 06:40 AM -- 06:45 AM (PDT)
Posterior Value Functions: Hindsight Baselines for Policy Gradient Methods
[ Paper ]
Spotlight
Thu Jul 22 06:45 AM -- 06:50 AM (PDT)
Learning Interaction Kernels for Agent Systems on Riemannian Manifolds
[ Paper ]