Skip to yearly menu bar Skip to main content


Reward Collapse in Aligning Large Language Models: A Prompt-Aware Approach to Preference Rankings

Ziang Song · Tianle Cai · Jason Lee · Weijie Su

Abstract

Video

Chat is not available.