Skip to yearly menu bar Skip to main content


Spotlight Poster

CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities

Yuxuan Zhu ⋅ Antony Kellermann ⋅ Dylan Bowman ⋅ Philip Li ⋅ Akul Gupta ⋅ Adarsh Danda ⋅ Richard Fang ⋅ Conner Jensen ⋅ Eric Ihli ⋅ Jason Benn ⋅ Jet Geronimo ⋅ Avi Dhir ⋅ Sudhit Rao ⋅ Kaicheng Yu ⋅ Twm Stone ⋅ Daniel Kang
2025 Spotlight Poster

Abstract

Lay Summary

Video

Chat is not available.