Skip to yearly menu bar Skip to main content


New Desiderata for Direct Preference Optimization

Xiangkun Hu ⋅ Tong He ⋅ David Wipf

Abstract

Video

Chat is not available.