Skip to yearly menu bar Skip to main content


ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment

Xiaoqiang Lin · Arun Verma · Zhongxiang Dai · Daniela Rus · See-Kiong Ng · Bryan Kian Hsiang Low

Abstract

Chat is not available.