Skip to yearly menu bar Skip to main content


A functional mirror ascent view of policy gradient methods with function approximation

Sharan Vaswani ⋅ Olivier Bachem ⋅ Simone Totaro ⋅ Matthieu Geist ⋅ Marlos C. Machado ⋅ Pablo Samuel Castro ⋅ Nicolas Le Roux

Abstract

Chat is not available.