Skip to yearly menu bar Skip to main content


Oral
in
Workshop: 2nd Workshop on Models of Human Feedback for AI Alignment (MoFA)
Fri, Jul 18, 2025 • 3:45 PM – 4:00 PM PDT

Dropping Just a Handful of Preferences Can Change Top Large Language Model Rankings

Jenny Huang · Yunyi Shen · Dennis Wei · Tamara Broderick

Abstract

Video

Chat is not available.