Toggle Poster Visibility
Oral
Thu Jul 21 10:30 AM -- 10:50 AM (PDT) @ Room 301 - 303
Learning Bellman Complete Representations for Offline Policy Evaluation
Spotlight
Thu Jul 21 10:50 AM -- 10:55 AM (PDT) @ Room 301 - 303
Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Spotlight
Thu Jul 21 10:55 AM -- 11:00 AM (PDT) @ Room 301 - 303
A Simple Reward-free Approach to Constrained Reinforcement Learning
[
Paper PDF]
Spotlight
Thu Jul 21 11:00 AM -- 11:05 AM (PDT) @ Room 301 - 303
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching
Spotlight
Thu Jul 21 11:05 AM -- 11:10 AM (PDT) @ Room 301 - 303
Temporal Difference Learning for Model Predictive Control
Spotlight
Thu Jul 21 11:10 AM -- 11:15 AM (PDT) @ Room 301 - 303
Model Selection in Batch Policy Optimization
Oral
Thu Jul 21 11:15 AM -- 11:35 AM (PDT) @ Room 301 - 303 None
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Spotlight
Thu Jul 21 11:35 AM -- 11:40 AM (PDT) @ Room 301 - 303
Optimal Estimation of Policy Gradient via Double Fitted Iteration
Spotlight
Thu Jul 21 11:40 AM -- 11:45 AM (PDT) @ Room 301 - 303
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes
Spotlight
Thu Jul 21 11:45 AM -- 11:50 AM (PDT) @ Room 301 - 303
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Spotlight
Thu Jul 21 11:50 AM -- 11:55 AM (PDT) @ Room 301 - 303
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
[
Paper PDF]
Spotlight
Thu Jul 21 11:55 AM -- 12:00 PM (PDT) @ Room 301 - 303
On the Role of Discount Factor in Offline Reinforcement Learning