Skip to yearly menu bar Skip to main content


Failure Modes of Learning Reward Models for LLMs and other Sequence Models

Silviu Pitis

Abstract

Video

Chat is not available.