Skip to yearly menu bar Skip to main content


Alignment as Distribution Learning: Your Preference Model is Explicitly a Language Model

Jihun Yun · Juno Kim · Jongho Park · Junhyuck Kim · Jongha (Jon) Ryu · Jaewoong Cho · Kwang-Sung Jun

Abstract

Chat is not available.