Skip to yearly menu bar Skip to main content


DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction

Aviral Kumar

Abstract

Video

Chat is not available.