Skip to yearly menu bar Skip to main content


A functional mirror ascent view of policy gradient methods with function approximation

Sharan Vaswani · Olivier Bachem · Simone Totaro · Matthieu Geist · Marlos C. Machado · Pablo Samuel Castro · Nicolas Le Roux

Abstract

Chat is not available.