Skip to yearly menu bar Skip to main content


Poster

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation

Mubashara Akhtar ⋅ Anka Reuel ⋅ Prajna Soni ⋅ Sanchit Ahuja ⋅ Pawan Sasanka Ammanamanchi ⋅ Ruchit Rawal ⋅ Vilém Zouhar ⋅ Srishti Yadav ⋅ Chenxi Whitehouse ⋅ Dayeon Ki ⋅ Jennifer Mickel ⋅ Leshem Choshen ⋅ Marek Šuppa ⋅ Jan Batzner ⋅ Jenny Chim ⋅ Jeba Sania ⋅ Yanan Long ⋅ Hossein A. Rahmani ⋅ Christina Knight ⋅ Yiyang Nan ⋅ Jyoutir Raj ⋅ Yu Fan ⋅ Shubham Singh ⋅ Subramanyam Sahoo ⋅ Eliya Habba ⋅ Usman Gohar ⋅ Siddhesh Pawar ⋅ Robert Scholz ⋅ Arjun Subramonian ⋅ Jingwei Ni ⋅ Mrinmaya Sachan ⋅ Mykel Kochenderfer ⋅ Sanmi Koyejo ⋅ Stella Biderman ⋅ Zeerak Talat ⋅ Avijit Ghosh ⋅ Irene Solaiman

Abstract

Log in and register to view live content