Toggle Poster Visibility
Oral
Thu Jul 12 05:30 AM -- 05:50 AM (PDT) @ A1
The Mirage of Action-Dependent Baselines in Reinforcement Learning
Oral
Thu Jul 12 05:50 AM -- 06:10 AM (PDT) @ A1
Smoothed Action Value Functions for Learning Gaussian Policies
Oral
Thu Jul 12 06:10 AM -- 06:20 AM (PDT) @ A1
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
[
PDF]
Oral
Thu Jul 12 06:20 AM -- 06:30 AM (PDT) @ A1
Addressing Function Approximation Error in Actor-Critic Methods