Skip to yearly menu bar Skip to main content


Reinforcement Learning from Human Text Feedback: Learning a Reward Model from Human Text Input

Belen Martin Urcelay · Andreas Krause · Giorgia Ramponi

Abstract

Video

Chat is not available.