Skip to yearly menu bar Skip to main content


Oral

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL

Jesse Farebrother · Jordi Orbay · Quan Vuong · Adrien Ali Taiga · Yevgen Chebotar · Ted Xiao · Alexander Irpan · Sergey Levine · Pablo Samuel Castro · Aleksandra Faust · Aviral Kumar · Rishabh Agarwal
2024 Oral

Abstract

Video

Chat is not available.