Skip to yearly menu bar Skip to main content


Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Rafael Rafailov · Yaswanth Chittepu · Ryan Park · Harshit Sikchi · Joey Hejna · William Knox · Chelsea Finn · Scott Niekum

Abstract

Video

Chat is not available.