Skip to yearly menu bar Skip to main content


Poster

Optimal Transport for Reward Modeling from Noisy Feedback

Eric Wang ⋅ Licheng Pan ⋅ Haocheng Yang ⋅ Yunsheng Lu ⋅ Yongqi Tong ⋅ Yinuo Wang ⋅ Shijian Wang ⋅ Zhixuan Chu ⋅ Lei Shen ⋅ Haoxuan Li ⋅ Yuan Lu

Abstract

Log in and register to view live content