Toggle Poster Visibility
Oral
Fri Jul 22 02:30 AM -- 02:50 AM (KST) @ Room 301 - 303
Learning Bellman Complete Representations for Offline Policy Evaluation
Spotlight
Fri Jul 22 02:50 AM -- 02:55 AM (KST) @ Room 301 - 303
Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Spotlight
Fri Jul 22 02:55 AM -- 03:00 AM (KST) @ Room 301 - 303
A Simple Reward-free Approach to Constrained Reinforcement Learning
[
Paper PDF]
Spotlight
Fri Jul 22 03:00 AM -- 03:05 AM (KST) @ Room 301 - 303
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching
Spotlight
Fri Jul 22 03:05 AM -- 03:10 AM (KST) @ Room 301 - 303
Temporal Difference Learning for Model Predictive Control
Spotlight
Fri Jul 22 03:10 AM -- 03:15 AM (KST) @ Room 301 - 303
Model Selection in Batch Policy Optimization
Oral
Fri Jul 22 03:15 AM -- 03:35 AM (KST) @ Room 301 - 303 None
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Spotlight
Fri Jul 22 03:35 AM -- 03:40 AM (KST) @ Room 301 - 303
Optimal Estimation of Policy Gradient via Double Fitted Iteration
Spotlight
Fri Jul 22 03:40 AM -- 03:45 AM (KST) @ Room 301 - 303
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes
Spotlight
Fri Jul 22 03:45 AM -- 03:50 AM (KST) @ Room 301 - 303
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Spotlight
Fri Jul 22 03:50 AM -- 03:55 AM (KST) @ Room 301 - 303
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
[
Paper PDF]
Spotlight
Fri Jul 22 03:55 AM -- 04:00 AM (KST) @ Room 301 - 303
On the Role of Discount Factor in Offline Reinforcement Learning
Successful Page Load