Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Jul 12 12:00 AM -- 12:20 AM (KST) @ A1
Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
Ronan Fruit · Matteo Pirotta · Alessandro Lazaric · Ronald Ortner
[ PDF [ Video
Oral
Thu Jul 12 12:20 AM -- 12:40 AM (KST) @ A1
Path Consistency Learning in Tsallis Entropy Regularized MDPs
Yinlam Chow · Ofir Nachum · Mohammad Ghavamzadeh
[ PDF [ Video
Oral
Thu Jul 12 12:40 AM -- 12:50 AM (KST) @ A1
Improved Regret Bounds for Thompson Sampling in Linear Quadratic Control Problems
Marc Abeille · Alessandro Lazaric
[ PDF [ Video
Oral
Thu Jul 12 12:50 AM -- 01:00 AM (KST) @ A1
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
Stephen Tu · Benjamin Recht
[ PDF [ Video