Skip to yearly menu bar Skip to main content


Poster

Reward Modeling from Natural Language Human Feedback

Zongqi Wang ⋅ Rui Wang ⋅ Yuchuan Wu ⋅ Yiyao Yu ⋅ Pinyi Zhang ⋅ Shaoning Sun ⋅ Yujiu Yang ⋅ Yongbin Li

Abstract

Log in and register to view live content