Skip to yearly menu bar Skip to main content


Oral

Trajectory-Based Off-Policy Deep Reinforcement Learning

Andreas Doerr · Michael Volpp · Marc Toussaint · Sebastian Trimpe · Christian Daniel
2019 Oral
[ Slides [ Video

Abstract

Chat is not available.