Skip to yearly menu bar Skip to main content


(10 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Wed Jun 12 03:00 AM -- 03:20 AM (KST) @ Hall B
ELF OpenGo: an analysis and open reimplementation of AlphaZero
Yuandong Tian · Jerry Ma · Qucheng Gong · Shubho Sengupta · Zhuoyuan Chen · James Pinkerton · Larry Zitnick
[ Video
Oral
Wed Jun 12 03:20 AM -- 03:25 AM (KST) @ Hall B
Making Deep Q-learning methods robust to time discretization
Corentin Tallec · Leonard Blier · Yann Ollivier
[ Slides [ Video
Oral
Wed Jun 12 03:25 AM -- 03:30 AM (KST) @ Hall B
Nonlinear Distributional Gradient Temporal-Difference Learning
chao qu · Shie Mannor · Huan Xu
[ Slides [ Video
Oral
Wed Jun 12 03:30 AM -- 03:35 AM (KST) @ Hall B
Composing Entropic Policies using Divergence Correction
Jonathan Hunt · Andre Barreto · Timothy Lillicrap · Nicolas Heess
[ Slides [ Video
Oral
Wed Jun 12 03:35 AM -- 03:40 AM (KST) @ Hall B
TibGM: A Transferable and Information-Based Graphical Model Approach for Reinforcement Learning
Tameem Adel · Adrian Weller
[ Slides [ Video
Oral
Wed Jun 12 03:40 AM -- 04:00 AM (KST) @ Hall B
Multi-Agent Adversarial Inverse Reinforcement Learning
Lantao Yu · Jiaming Song · Stefano Ermon
[ Slides [ Video
Oral
Wed Jun 12 04:00 AM -- 04:05 AM (KST) @ Hall B
Policy Consolidation for Continual Reinforcement Learning
Christos Kaplanis · Murray Shanahan · Claudia Clopath
[ Slides [ Video
Oral
Wed Jun 12 04:05 AM -- 04:10 AM (KST) @ Hall B
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto · David Meger · Doina Precup
[ Slides [ Video
Oral
Wed Jun 12 04:10 AM -- 04:15 AM (KST) @ Hall B
Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation
Ruohan Wang · Carlo Ciliberto · Pierluigi Vito Amadori · Yiannis Demiris
[ Slides [ Video
Oral
Wed Jun 12 04:15 AM -- 04:20 AM (KST) @ Hall B
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective
Zhao Song · Ron Parr · Lawrence Carin
[ Slides [ Video