Tutorial
Online and non-stochastic control
Elad Hazan · Karan Singh
Virtual
In recent years new methods have emerged in control and reinforcement learning that incorporate techniques from regret minimization and online convex optimization. The resulting theory give rise to provable guarantees for some longstanding questions in control and reinforcement learning: logarithmic regret and fast rates, end-to-end LQG-LQR without system knowledge, Kalman filtering with adversarial noise, black-box control with provable finite-time guarantees, tight lower bounds for system identification, and more.
The main innovation in these results stems from an online control model which replaces stochastic perturbations by adversarial ones, and the goal of optimal control with regret minimization. We will describe the setting, as well as novel methods that are gradient-based and rely on novel convex relaxations.
Schedule
Mon 12:00 p.m. - 1:15 p.m.
|
Online and non-stochastic control
(
Talk 1
)
>
SlidesLive Video |
Elad Hazan 🔗 |
Mon 1:15 p.m. - 1:45 p.m.
|
Online and non-stochastic control
|
🔗 |
Mon 1:45 p.m. - 3:00 p.m.
|
Online and non-stochastic control
(
Talk 2
)
>
SlidesLive Video |
Karan Singh 🔗 |