Skip to yearly menu bar Skip to main content


REBEL: Reinforcement Learning via Regressing Relative Rewards

Zhaolin Gao · Jonathan Chang · Wenhao Zhan · Owen Oertell · Gokul Swamy · Kianté Brantley · Thorsten Joachims · Drew Bagnell · Jason Lee · Wen Sun

Abstract

Video

Chat is not available.