4   Show all »
Toggle Poster Visibility
Oral
Thu Jul 12th 02:30 -- 02:50 PM @ A1
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker · Surya Bhupatiraju · Shixiang Gu · Richard E Turner · Zoubin Ghahramani · Sergey Levine
Oral
Thu Jul 12th 02:50 -- 03:10 PM @ A1
Smoothed Action Value Functions for Learning Gaussian Policies
Ofir Nachum · Mohammad Norouzi · George Tucker · Dale Schuurmans
Oral
Thu Jul 12th 03:10 -- 03:20 PM @ A1
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja · Aurick Zhou · Pieter Abbeel · Sergey Levine
Oral
Thu Jul 12th 03:20 -- 03:30 PM @ A1
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto · Herke van Hoof · David Meger