Skip to yearly menu bar Skip to main content


Theoretical Analysis of KL-regularized RLHF with Multiple Reference Models

Gholamali Aminian ⋅ Amir R. Asadi ⋅ Idan Shenfeld ⋅ Youssef Mroueh

Abstract

Chat is not available.