Skip to yearly menu bar Skip to main content


Poster

DPO Unchained: Your Training Algorithm is Secretly Disentangled in Human Choice Theory (and Its Loss' Convexity is Dispensable)

Wenxuan Zhou ⋅ Shujian Zhang ⋅ brice magdalou ⋅ John Lambert ⋅ Ehsan Amid ⋅ Richard Nock ⋅ Andrew Hard

Abstract

Log in and register to view live content