Skip to yearly menu bar Skip to main content


Reinforcement Learning from Human Text Feedback: Learning a Reward Model from Human Text Input

Belen Martin Urcelay ⋅ Andreas Krause ⋅ Giorgia Ramponi

Abstract

Video

Chat is not available.