Skip to yearly menu bar Skip to main content


Poster

CentaurEval: Benchmarking Human-in-the-Loop Value in Agentic Coding

Hanjun Luo ⋅ Chiming Ni ⋅ Jiaheng Wen ⋅ Zhimu Huang ⋅ Bingduo Liao ⋅ Yiran Wang ⋅ Sylvia Chung Yan Shan ⋅ Yingbin Jin ⋅ Jialin Li ⋅ Xinfeng Li ⋅ Wenyuan Xu ⋅ XiaoFeng Wang ⋅ Hanan Salam

Abstract

Log in and register to view live content