Skip to yearly menu bar Skip to main content


Poster
in
Workshop: CODEML: Championing Open-source DEvelopment in Machine Learning
Fri, Jul 18, 2025 • 2:15 PM – 3:00 PM PDT

A2Perf: Benchmarking Autonomous Agents End-to-End in Realistic Domains

Ikechukwu Uchendu · Jason Jabbour · Korneel Van den Berghe · Joel Runevic · Matthew Stewart · Jeffrey Ma · Srivatsan Krishnan · Izzeddin Gur · Austin Huang · Colton Bishop · Paige Bailey · Wenjie Jiang · Ebrahim M. Songhori · Sergio Guadarrama · Jie Tan · Jordan Terry · Aleksandra Faust · Vijay Janapa Reddi

Abstract

Chat is not available.