Skip to yearly menu bar Skip to main content


Robust Reward Modeling via Causal Rubrics

Pragya Srivastava ⋅ Harman Singh ⋅ Rahul Madhavan ⋅ Gandharv Patil ⋅ Sravanti Addepalli ⋅ Arun Sai Suggala ⋅ Rengarajan Aravamudhan ⋅ Soumya Sharma ⋅ Anirban Laha ⋅ Aravindan Raghuveer ⋅ Karthikeyan Shanmugam ⋅ Doina Precup

Abstract

Chat is not available.