Learning with Missing Values

Workshop

Learning with Missing Values

Julie Josse · Jes Frellsen · Pierre-Alexandre Mattei · Gael Varoquaux

[ Abstract ] Workshop Website

[ Project Page ]

Analysis of large amounts of data offers new opportunities to understand many processes better. Yet, data accumulation often implies relaxing acquisition procedures or compounding diverse sources, leading to many observations with missing features. From questionnaires to collaborative filtering, from electronic health records to single-cell analysis, missingness is everywhere at play and is rather the norm than the exception. Even “clean” data sets are often barely “cleaned” versions of incomplete data sets—with all the unfortunate biases this cleaning process may have created.

Despite this ubiquity, tackling missing values is often overlooked. Handling missing values poses many challenges, and there is a vast literature in the statistical community, with many implementations available. Yet, there are still many open issues and the need to design new methods or to introduce new point of views: for missing values in a supervised-learning setting, in deep learning architectures, to adapt available methods for high dimensional observed data with different type of missing values, deal with feature mismatch and distribution mismatch. Missing data is one of the eight pillars of causal wisdom for Judea Pearl who brought graphical model reasoning to tackle some missing not at random values.

To the best of our knowledge, this is the first workshop at the major machine learning conferences focusing primarily on missing value problems in recent years. The goal of our workshop is to give more momentum and exposition to research on missing values, both theoretical and methodological, and emphasize the connections with other areas of machine learning (e.g. causal inference, generative modelling, uncertainty quantification, transfer learning, distributional shift, etc.). We will also attach importance to discussing the reproducibility problems that can be caused by missing data, the danger of forgetting the missing values issues and the importance of providing sound implementations.

We welcome both academic and industrial practitioners/researchers. In particular, since missing data is a critical issue in many applications, we would like to federate industrial/applied know-how and various academic approaches.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Fri 1:45 a.m. - 2:00 a.m.	Opening Session ( Discussion ) >	Julie Josse · Jes Frellsen · Pierre-Alexandre Mattei · Gael Varoquaux 🔗
Fri 2:00 a.m. - 3:00 a.m.	Poster session 1 ( Posters ) >	🔗
Fri 4:30 a.m. - 5:10 a.m.	Invited Talk: Learning despite the unknown - missing data imputation in healthcare ( Talk ) > link SlidesLive Video Link	Mihaela van der Schaar 🔗
Fri 5:10 a.m. - 5:50 a.m.	Invited Talk: Imputing Missing Data with the Gaussian Copula ( Talk ) > link Link	Madeleine Udell 🔗
Fri 5:50 a.m. - 6:30 a.m.	Discussion and Q&A by Gael Varoquaux, Julie Josse and Pierre Alexandre Mattei ( Discussion Panel ) >	🔗
Fri 6:30 a.m. - 7:10 a.m.	Invited Talk: Efficient Missing-value Acquisition with Variational Autoencoders ( Talk ) > link SlidesLive Video Link	Jose Miguel Hernandez-Lobato 🔗
Fri 7:10 a.m. - 7:50 a.m.	Invited Talk: What Interpretable Machine Learning Can Tell Us About Missing Values ( Talk ) > link SlidesLive Video Link	Rich Caruana 🔗
Fri 7:50 a.m. - 8:30 a.m.	Discussion and Q&A by Gael Varoquaux and Jes Frellsen ( Discussion ) >	🔗
Fri 8:30 a.m. - 9:10 a.m.	Poster session 2 ( Poster ) >	🔗
Fri 9:10 a.m. - 9:50 a.m.	Invited Talk: Graphical Models based Solutions for Missing Data Problems. ( Talk Live ) > link Link	Karthika Mohan 🔗
Fri 9:50 a.m. - 10:30 a.m.	Invited Talk: Sequentially additive nonignorable missing data modelling using auxiliary marginal information ( Talk Live ) >	Mauricio Sadinle 🔗
Fri 10:30 a.m. - 11:10 a.m.	Discussion and Q&A by Ilya Shpitser - Identifiability of the full law in graphical missing data models ( Discussion Panel ) >	🔗
Fri 11:10 a.m. -	Informal gathering with drinks to celebrate ( Discussion Panel ) >	🔗