Toggle Poster Visibility
Oral
Wed Jul 24 07:30 AM -- 07:45 AM (PDT) @ Hall C 1-3 None
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Oral
Wed Jul 24 07:45 AM -- 08:00 AM (PDT) @ Hall C 1-3 None
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Oral
Wed Jul 24 08:00 AM -- 08:15 AM (PDT) @ Hall C 1-3 None
SAPG: Split and Aggregate Policy Gradients
Oral
Wed Jul 24 08:15 AM -- 08:30 AM (PDT) @ Hall C 1-3 None
Rate-Optimal Policy Optimization for Linear Markov Decision Processes