Skip to yearly menu bar Skip to main content


Poster Wed, Jul 8, 2026 • 1:00 AM – 2:45 AM PDT HALL A #3801

Removing Noise, not Finding Gold: Quality Filtering for Large-Scale Pretraining

Thiziri Nait Saada ⋅ Louis Béthune ⋅ Michal Klein ⋅ David Grangier ⋅ Marco Cuturi ⋅ Pierre Ablin

Abstract

Log in and register to view live content