Skip to yearly menu bar Skip to main content


Spotlight Poster Thu, Jul 17, 2025 • 4:30 PM – 7:00 PM PDT

CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities

Yuxuan Zhu · Antony Kellermann · Dylan Bowman · Philip Li · Akul Gupta · Adarsh Danda · Richard Fang · Conner Jensen · Eric Ihli · Jason Benn · Jet Geronimo · Avi Dhir · Sudhit Rao · Kaicheng Yu · Twm Stone · Daniel Kang

Abstract

Lay Summary

Video

Chat is not available.