Skip to yearly menu bar Skip to main content


The Limits of Preferences: Navigating Human-AI Feedback Tradeoffs in Alignment

Valentina Pyatkin

Abstract

Video

Chat is not available.