(5 events)   Timezone: »  
Show all »
Toggle Poster Visibility
Oral
Wed Jul 11 02:00 AM -- 02:20 AM (PDT) @ A1
Problem Dependent Reinforcement Learning Bounds Which Can Identify Bandit Structure in MDPs
Andrea Zanette · Emma Brunskill
Oral
Wed Jul 11 02:20 AM -- 02:30 AM (PDT) @ A1
Learning with Abandonment
Sven Schmit · Ramesh Johari
Oral
Wed Jul 11 02:30 AM -- 02:40 AM (PDT) @ A1
Lipschitz Continuity in Model-based Reinforcement Learning
Kavosh Asadi · Dipendra Misra · Michael L. Littman
Oral
Wed Jul 11 02:40 AM -- 02:50 AM (PDT) @ A1
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney · Georg Ostrovski · David Silver · Remi Munos
Oral
Wed Jul 11 02:50 AM -- 03:00 AM (PDT) @ A1
More Robust Doubly Robust Off-policy Evaluation
Mehrdad Farajtabar · Yinlam Chow · Mohammad Ghavamzadeh