4   Show all »
Toggle Poster Visibility
Oral
Thu Jul 12th 11:00 -- 11:20 AM @ A1
Convergent Tree Backup and Retrace with Function Approximation
Ahmed Touati · Pierre-Luc Bacon · Doina Precup · Pascal Vincent
Oral
Thu Jul 12th 11:20 -- 11:40 AM @ A1
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai · Albert Shaw · Lihong Li · Lin Xiao · Niao He · Zhen Liu · Jianshu Chen · Le Song
Oral
Thu Jul 12th 11:40 -- 11:50 AM @ A1
Scalable Bilinear Pi Learning Using State and Action Features
Yichen Chen · Lihong Li · Mengdi Wang
Oral
Thu Jul 12th 11:50 AM -- 12:00 PM @ A1
Stochastic Variance-Reduced Policy Gradient
Matteo Papini · Damiano Binaghi · Giuseppe Canonaco · Matteo Pirotta · Marcello Restelli