Skip to yearly menu bar Skip to main content


Finite time analysis of temporal difference learning with linear function approximation: the tail averaged case

Gandharv Patil ⋅ Prashanth L.A. ⋅ Doina Precup

Abstract

Chat is not available.