DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction
Aviral Kumar
Abstract
http://slideslive.com/38931343
Video
Chat is not available.
Successful Page Load