Skip to yearly menu bar Skip to main content


New Desiderata for Direct Preference Optimization

Xiangkun Hu · Tong He · David Wipf

Abstract

Video

Chat is not available.