Skip to yearly menu bar Skip to main content


Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Rafael Rafailov ⋅ Yaswanth Chittepu ⋅ Ryan Park ⋅ Harshit Sikchi ⋅ Joey Hejna ⋅ William Knox ⋅ Chelsea Finn ⋅ Scott Niekum

Abstract

Video

Chat is not available.