Skip to yearly menu bar Skip to main content


Poster

Faults in Our Formal Benchmarking: Dataset Defects and Evaluation Failures in Lean Theorem Proving

Pawan Sasanka Ammanamanchi ⋅ Siddharth Bhat ⋅ Stella Biderman

Abstract

Log in and register to view live content