Skip to yearly menu bar Skip to main content


Poster

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

Xiang Deng ⋅ Jeff Da ⋅ Edwin Pan ⋅ Yannis Yiming He ⋅ Charles Ide ⋅ Kanak Garg ⋅ Niklas Lauffer ⋅ Andrew Park ⋅ Chetan Rane ⋅ Karmini Sampath ⋅ Maya Krishnan ⋅ Srivatsa Kundurthy ⋅ Sean Hendryx ⋅ Zifan Wang ⋅ Chen Bo Calvin Zhang ⋅ Noah Jacobson ⋅ Bing Liu ⋅ Brad Kenstler

Abstract

Log in and register to view live content