Skip to yearly menu bar Skip to main content


Invited Talk

Proxy objectives in reinforcement learning from human feedback

John Schulman
2023 Invited Talk

Abstract

Video

Chat is not available.