Toggle Poster Visibility
Oral
Thu Jul 12 09:30 PM -- 09:50 PM (KST) @ A1
The Mirage of Action-Dependent Baselines in Reinforcement Learning
Oral
Thu Jul 12 09:50 PM -- 10:10 PM (KST) @ A1
Smoothed Action Value Functions for Learning Gaussian Policies
Oral
Thu Jul 12 10:10 PM -- 10:20 PM (KST) @ A1
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
[
PDF]
Oral
Thu Jul 12 10:20 PM -- 10:30 PM (KST) @ A1
Addressing Function Approximation Error in Actor-Critic Methods
Successful Page Load