Skip to yearly menu bar Skip to main content


Poster

Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios

Jianxiang Zang ⋅ Yongda Wei ⋅ Ruxue Bai ⋅ Shiyu Jiang ⋅ Nijia Mo ⋅ Binhong Li ⋅ Qiang Sun ⋅ Hui Liu

Abstract

Log in and register to view live content