Skip to yearly menu bar Skip to main content


Tutorial

Proving Theorems with Lean and Machine Learning

Rémy Degenne ⋅ Wenda Li
Jul 6, 9:00 AM - 11:30 AM HALL B2
View full details
Tutorial

Probabilistic Numerics — Computation is Machine Learning

Philipp Hennig ⋅ Marvin Pförtner ⋅ Tim Weiland
Jul 6, 9:00 AM - 11:30 AM HALL D2
View full details
Tutorial

Unlearning Data at Scale

Vinith Suriyakumar ⋅ Gautam Kamath ⋅ Ashia Wilson
Jul 6, 9:00 AM - 11:30 AM AUDITORIUM
View full details
Tutorial

Diffusion and Flow-Matching: From Memorization to Generalization & Beyond

Mathurin Massias ⋅ Quentin Bertrand
Jul 6, 9:00 AM - 11:30 AM HALL D1
View full details
Tutorial

Is numerical optimization theory irrelevant to machine learning practice in 2026?

Mark Schmidt
Jul 6, 1:30 PM - 4:00 PM HALL D1
We are seeing more numerical optimization theory papers published than ever before. These papers often make unrealistic assumptions or propose algorithms that never get adopted. So is all this optimization theory largely useless?

In this tutorial I show how some surprisingly simple optimization ideas can explain a wide variety of the implementation choices we make when training modern deep learning models. Some of these ideas might have let us skip some generations of grad-student descent, or have led to state-of-the-art tricks in modern architectures. On the other hand, I will highlight how some important practical ideas are not explained by optimization theory and where we can go from here.

Here is a list of keywords to get you (and your LLM sidekick) interested in attending: Adam and [*]A[*]d[*]a[*]m[*], Muon and its friends/enemies, critical-ish batch size, the RMSnorm and skip connection love affair, dead ReLUs and living SwiGLU, Schedule-Free and WSD and muP and max\_grad\_norm = 1.0, variance reduction and shuffle=True, and maybe edge-of-stability/catapults/feature-learning. I may also tell you why your second-order stochastic optimization method did not work.
Show more
View full details
Tutorial

Adaptive Reasoning in LLMs: From Post-Training to Test-Time Learning

Akhil Arora ⋅ Nouha Dziri
Jul 6, 1:30 PM - 4:00 PM HALL C
View full details
Tutorial

Evaluating and Training LLMs for Math Copilots and Theorem Proving

Simon Frieder ⋅ Philip Vonderlind
Jul 6, 1:30 PM - 4:00 PM HALL B2
View full details
Tutorial

Calibration: From Predictions to Decisions, Collaboration, and Alignment

Aaron Roth ⋅ Collina ⋅ Ira Globus-Harris
Jul 6, 1:30 PM - 4:00 PM AUDITORIUM
View full details
Tutorial

New Techniques for Sequence Prediction: Spectral Filtering and Preconditioning

Elad Hazan ⋅ Annie Marsden
Jul 6, 1:30 PM - 4:00 PM HALL D2
View full details