5   Show all »
Toggle Poster Visibility
Oral
Wed Jul 11th 11:00 -- 11:20 AM @ A1
Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs
Andrea Zanette · Emma Brunskill
Oral
Wed Jul 11th 11:20 -- 11:30 AM @ A1
Learning with Abandonment
Sven Schmit · Ramesh Johari
Oral
Wed Jul 11th 11:30 -- 11:40 AM @ A1
Lipschitz Continuity in Model-based Reinforcement Learning
Kavosh Asadi · Dipendra Misra · Michael L. Littman
Oral
Wed Jul 11th 11:40 -- 11:50 AM @ A1
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney · Georg Ostrovski · David Silver · Remi Munos
Oral
Wed Jul 11th 11:50 AM -- 12:00 PM @ A1
More Robust Doubly Robust Off-policy Evaluation
Mehrdad Farajtabar · Yinlam Chow · Mohammad Ghavamzadeh