Skip to yearly menu bar Skip to main content


Spotlight Poster

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

Saurabh Jha ⋅ Rohan Arora ⋅ Yuji Watanabe ⋅ Takumi Yanagawa ⋅ Yinfang Chen ⋅ Jackson Clark ⋅ Bhavya Bhavya ⋅ Mudit Verma ⋅ Harshit Kumar ⋅ Hirokuni Kitahara ⋅ Noah Zheutlin ⋅ Saki Takano ⋅ Divya Pathak ⋅ Felix George ⋅ Xinbo Wu ⋅ Bekir Turkkan ⋅ Gerard Vanloo ⋅ Michael Nidd ⋅ Ting Dai ⋅ Oishik Chatterjee ⋅ Pranjal Gupta ⋅ Suranjana Samanta ⋅ Pooja Aggarwal ⋅ Rong Lee ⋅ Jae-wook Ahn ⋅ Debanjana Kar ⋅ Amit Paradkar ⋅ Yu Deng ⋅ Pratibha Moogi ⋅ Prateeti Mohapatra ⋅ Naoki Abe ⋅ Chandrasekhar Narayanaswami ⋅ Tianyin Xu ⋅ Lav Varshney ⋅ Ruchi Mahindru ⋅ Anca Sailer ⋅ Laura Shwartz ⋅ Daby Sow ⋅ Nicholas Fuller ⋅ Ruchir Puri
2025 Spotlight Poster

Abstract

Lay Summary

Video

Chat is not available.