Skip to yearly menu bar Skip to main content


Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Shenao Zhang · Donghan Yu · Hiteshi Sharma · Ziyi Yang · Shuohang Wang · Hany Hassan Awadalla · Zhaoran Wang

Abstract

Chat is not available.