Skip to yearly menu bar Skip to main content


Oral

VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

Thomas Zeng ⋅ Shuibai Zhang ⋅ Shutong Wu ⋅ Christian Classen ⋅ Daewon Chae ⋅ Ethan Ewer ⋅ Minjae Lee ⋅ Heeju Kim ⋅ Wonjun Kang ⋅ Jackson Kunde ⋅ Ying Fan ⋅ Jungtaek Kim ⋅ HYUNG IL KOO ⋅ Kannan Ramchandran ⋅ Dimitris Papailiopoulos ⋅ Kangwook Lee
2025 Oral

Abstract

Lay Summary

Video

Chat is not available.