Skip to yearly menu bar Skip to main content


Tutorial

Reinforcement Learning from Human Feedback: A Tutorial *

Dmitry Ustalov · Nathan Lambert
2023 Tutorial

Video

Chat is not available.