ICML Poster Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ Regret

Poster

Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ Regret

Alon Cohen · Tomer Koren · Yishay Mansour

Pacific Ballroom #159

Keywords: [ Bandits ] [ Online Learning ] [ Theory and Algorithms ]

[ Abstract ]

Abstract: We present the first computationally-efficient algorithm with

\tilde{O} (\sqrt{T})

$\widetilde{O}(\sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown dynamics. By that, we resolve an open question of Abbasi-Yadkori and Szepesvari (2011) and Dean,Mania, Matni, Recht, and Tu (2018).

Live content is unavailable. Log in and register to view live content

Poster

Learning Linear-Quadratic Regulators Efficiently with only √TT\sqrt{T} Regret

Alon Cohen · Tomer Koren · Yishay Mansour

Pacific Ballroom #159

Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ Regret