Toggle Poster Visibility
Oral
Wed Jul 11 08:00 AM -- 08:20 AM (PDT) @ A1
Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
Oral
Wed Jul 11 08:20 AM -- 08:40 AM (PDT) @ A1
Path Consistency Learning in Tsallis Entropy Regularized MDPs
Oral
Wed Jul 11 08:40 AM -- 08:50 AM (PDT) @ A1
Improved Regret Bounds for Thompson Sampling in Linear Quadratic Control Problems
Oral
Wed Jul 11 08:50 AM -- 09:00 AM (PDT) @ A1
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator