Tutorial

Reinforcement Learning from Human Feedback: A Tutorial *

Dmitry Ustalov · Nathan Lambert

Ballroom B
[ Project Page ]
[ Slides
Mon 24 Jul 12:30 p.m. PDT — 3 p.m. PDT

Abstract:

Chat is not available.