(4 events)   Timezone: »  
Show all »
Toggle Poster Visibility
Oral
Thu Jul 12 02:00 AM -- 02:20 AM (PDT) @ A1
Convergent Tree Backup and Retrace with Function Approximation
Ahmed Touati · Pierre-Luc Bacon · Doina Precup · Pascal Vincent
Oral
Thu Jul 12 02:20 AM -- 02:40 AM (PDT) @ A1
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai · Albert Shaw · Lihong Li · Lin Xiao · Niao He · Zhen Liu · Jianshu Chen · Le Song
Oral
Thu Jul 12 02:40 AM -- 02:50 AM (PDT) @ A1
Scalable Bilinear Pi Learning Using State and Action Features
Yichen Chen · Lihong Li · Mengdi Wang
Oral
Thu Jul 12 02:50 AM -- 03:00 AM (PDT) @ A1
Stochastic Variance-Reduced Policy Gradient
Matteo Papini · Damiano Binaghi · Giuseppe Canonaco · Matteo Pirotta · Marcello Restelli