Poster
Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ Regret
Alon Cohen · Tomer Koren · Yishay Mansour
Pacific Ballroom #159
Keywords: [ Bandits ] [ Online Learning ] [ Theory and Algorithms ]
[
Abstract
]
Abstract:
We present the first computationally-efficient algorithm with $\widetilde{O}(\sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown dynamics.
By that, we resolve an open question of Abbasi-Yadkori and Szepesvari (2011) and Dean,Mania, Matni, Recht, and Tu (2018).
Live content is unavailable. Log in and register to view live content