Skip to yearly menu bar Skip to main content


ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment

Xiaoqiang Lin ⋅ Arun Verma ⋅ Zhongxiang Dai ⋅ Daniela Rus ⋅ See-Kiong Ng ⋅ Bryan Kian Hsiang Low

Abstract

Chat is not available.