in
Workshop: 2nd AI for Math Workshop @ ICML 2025

AIMO Remarks - AI4Math in 2025: Closing Gaps, Exposing Fault Lines

Simon Frieder

2025
in
Workshop: 2nd AI for Math Workshop @ ICML 2025

Abstract

This talk surveys recent progress within the AI4Math field, focusing on the top-performing submissions from the 2025 AI athematical Olympiad (AIMO) — one of Kaggle’s largest competitions to date — and the broader evolution of the field beyond Olympiad-style problem solving. We present two key findings: First, where consensus exists on what the right benchmark tasks are to track performance, open-source models are rapidly closing the gap with proprietary systems. Second, in areas where benchmark consensus is lacking — reflecting the sparsity of evaluation standards in AI4Math — both open-source and proprietary LLMs exhibit comparable limitations.

Video

Chat is not available.