Skip to yearly menu bar Skip to main content


Poster

When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models

Tong Xie ⋅ Ching-Yuan Bai ⋅ Yuanhao Ban ⋅ Yunqi Hong ⋅ Haoyu Li ⋅ Cho-Jui Hsieh

Abstract

Log in and register to view live content