Skip to yearly menu bar Skip to main content


Invited Talk

Proxy objectives in reinforcement learning from human feedback

John Schulman
2023 Invited Talk

Abstract

Speaker

John Schulman

John Schulman

John now leads a team working on ChatGPT and RL from Human Feedback at OpenAI, where he was a cofounder. His recent published work includes combining language models with retrieval (WebGPT) and scaling laws of RL and alignment. Earlier he developed some of the foundational methods of deep RL (TRPO, PPO). Before OpenAI, John got a PhD from UC Berkeley, advised by Pieter Abbeel. In his free time, he enjoys running, jazz piano, and raising chickens.

Video

Chat is not available.