Skip to yearly menu bar Skip to main content


Poster

CUARewardBench: Benchmark for Evaluating Reward Models on Computer-using Agent Trajectories

Haojia Lin ⋅ Xiaoyu Tan ⋅ Yulei Qin ⋅ Zihan Xu ⋅ Yuchen Shi ⋅ Zongyi Li ⋅ Gang Li ⋅ Shaofei Cai ⋅ Siqi Cai ⋅ Yuzheng Cai ⋅ Chaoyou Fu ⋅ Ke Li ⋅ Xing Sun

Abstract

Log in and register to view live content