Skip to yearly menu bar Skip to main content


Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment

Zhaofeng Wu ⋅ Ananth Balashankar ⋅ Yoon Kim ⋅ Jacob Eisenstein ⋅ Ahmad Beirami

Abstract

Video

Chat is not available.