Skip to yearly menu bar Skip to main content


Poster

BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs

Ivo Petrov ⋅ Jasper Dekoninck ⋅ Martin Vechev

Abstract

Log in and register to view live content