Timezone: »
Oral
On Medians of (Randomized) Pairwise Means
Stephan Clemencon · Pierre Laforgue · Patrice Bertail
Tournament procedures, recently introduced in \cite{lugosi2016risk}, offer an appealing alternative, from a theoretical perspective at least, to the principle of \textit{Empirical Risk Minimization} in machine learning. Statistical learning by Median-of-Means (MoM) basically consists in segmenting the training data into blocks of equal size and comparing the statistical performance of every pair of candidate decision rules on each data block: that with highest performance on the majority of the blocks is declared as the winner. In the context of nonparametric regression, functions having won all their duels have been shown to outperform empirical risk minimizers w.r.t. the mean squared error under minimal assumptions, while exhibiting robustness properties. It is the purpose of this paper to extend this approach, in order to address other learning problems in particular, for which the performance criterion takes the form of an expectation over pairs of observations rather than over one single observation, as may be the case in pairwise ranking, clustering or metric learning. Precisely, it is proved here that the bounds achieved by MoM are essentially conserved when the blocks are built by means of independent sampling without replacement schemes instead of a simple segmentation. These results are next extended to situations where the risk is related to a pairwise loss function and its empirical counterpart is of the form of a $U$-statistic. Beyond theoretical results guaranteeing the performance of the learning/estimation methods proposed, some numerical experiments provide empirical evidence of their relevance in practice.
Author Information
Stephan Clemencon (Telecom ParisTech)
Pierre Laforgue (Télécom ParisTech)
Patrice Bertail (Université Paris Nanterre)
Related Events (a corresponding poster, oral, or spotlight)
-
2019 Poster: On Medians of (Randomized) Pairwise Means »
Thu. Jun 13th 01:30 -- 04:00 AM Room Pacific Ballroom #200
More from the Same Authors
-
2023 Poster: Robust Consensus in Ranking Data Analysis: Definitions, Properties and Computational Issues »
Morgane Goibert · Clément Calauzènes · Ekhine IRUROZKI · Stephan Clemencon -
2022 Poster: Mitigating Gender Bias in Face Recognition using the von Mises-Fisher Mixture Model »
Jean-Rémy Conti · Nathan NOIRY · Stephan Clemencon · Vincent Despiegel · Stéphane Gentric -
2022 Spotlight: Mitigating Gender Bias in Face Recognition using the von Mises-Fisher Mixture Model »
Jean-Rémy Conti · Nathan NOIRY · Stephan Clemencon · Vincent Despiegel · Stéphane Gentric -
2021 Poster: Learning from Biased Data: A Semi-Parametric Approach »
Patrice Bertail · Stephan Clémençon · Yannick Guyonvarch · Nathan NOIRY -
2021 Spotlight: Learning from Biased Data: A Semi-Parametric Approach »
Patrice Bertail · Stephan Clémençon · Yannick Guyonvarch · Nathan NOIRY -
2020 Poster: Duality in RKHSs with Infinite Dimensional Outputs: Application to Robust Losses »
Pierre Laforgue · Alex Lambert · Luc Brogat-Motte · Florence d'Alche-Buc