Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Jul 12 02:00 AM -- 02:20 AM (PDT) @ A1
Convergent Tree Backup and Retrace with Function Approximation
Ahmed Touati · Pierre-Luc Bacon · Doina Precup · Pascal Vincent
[ PDF [ Video
Oral
Thu Jul 12 02:20 AM -- 02:40 AM (PDT) @ A1
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai · Albert Shaw · Lihong Li · Lin Xiao · Niao He · Zhen Liu · Jianshu Chen · Le Song
[ PDF [ Video
Oral
Thu Jul 12 02:40 AM -- 02:50 AM (PDT) @ A1
Scalable Bilinear Pi Learning Using State and Action Features
Yichen Chen · Lihong Li · Mengdi Wang
[ PDF [ Video
Oral
Thu Jul 12 02:50 AM -- 03:00 AM (PDT) @ A1
Stochastic Variance-Reduced Policy Gradient
Matteo Papini · Damiano Binaghi · Giuseppe Canonaco · Matteo Pirotta · Marcello Restelli
[ PDF [ Video