Skip to yearly menu bar Skip to main content


Reward Collapse in Aligning Large Language Models: A Prompt-Aware Approach to Preference Rankings

Ziang Song ⋅ Tianle Cai ⋅ Jason Lee ⋅ Weijie Su

Abstract

Video

Chat is not available.