Skip to yearly menu bar Skip to main content


Poster

PaperBench: Evaluating AI’s Ability to Replicate AI Research

Giulio Starace ⋅ Oliver Jaffe ⋅ Dane Sherburn ⋅ James Aung ⋅ Jun Shern Chan ⋅ Leon Maksin ⋅ Rachel Dias ⋅ Evan Mays ⋅ Benjamin Kinsella ⋅ Wyatt Thompson ⋅ Johannes Heidecke ⋅ Amelia Glaese ⋅ Tejal Patwardhan
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.