Skip to yearly menu bar Skip to main content


Poster

Normalized Rewards for Preference Optimization

Shawn Im ⋅ Federico Danieli ⋅ Skyler Seto ⋅ Barry-John Theobald ⋅ Katherine Metcalf

Abstract

Log in and register to view live content