Skip to yearly menu bar Skip to main content


Poster

CyberCycle: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities

Tianneng Shi ⋅ Robin Rheem ⋅ Dongwei Jiang ⋅ Francisco De La Riega ⋅ Mona Wang ⋅ Zhun Wang ⋅ Jingzhi Jiang ⋅ Alexander Cheung ⋅ Sean Tai ⋅ Jonah Cha ⋅ Jianhong Tu ⋅ Gabriel Han ⋅ Chenguang Wang ⋅ Wenbo Guo ⋅ Jingxuan He ⋅ Dawn Song

Abstract

Log in and register to view live content