9e Journée Statistique et Informatique pour la Science des Données à Paris-Saclay

Name: 9e Journée Statistique et Informatique pour la Science des Données à Paris-Saclay
Start: 2024-04-03T09:00:00+02:00
End: 2024-04-03T17:30:00+02:00
Location: Le Bois-Marie

3 avril 2024

Le Bois-Marie

Fuseau horaire Europe/Paris

Cécile Gourgues

Learning with Missing Values: Theoretical Insights and Application to Health Databases

3 avr. 2024, 11:20

50m

Centre de Conférences Marilyn et James Simons (Le Bois-Marie)

Centre de Conférences Marilyn et James Simons

Le Bois-Marie

35, route de Chartres CS 40001 91893 Bures-sur-Yvette Cedex

Marine Le Morvan (INRIA, Saclay)

Missing values are ubiquitous in many fields such as health, business or social sciences. To date, much of the literature on missing values has focused on imputation as well as inference with incomplete data. In contrast, supervised learning in the presence of missing values has received little attention. In this talk I will explain the challenges posed by missing values in regression and classification tasks. In practice, a common solution consists in imputing the missing values prior to learning. I will show how different baseline methods for handling missing values compare on several large health databases with naturally occurring missing values. We will then examine the theoretical foundations of Impute-then-Regress approaches. Finally, I will present a neural network architecture for learning with missing values that goes beyond the two-stage Impute-then-Regress approaches.

Vidéo

9e Journée Statistique et Informatique pour la Science des Données à Paris-Saclay

Cécile Gourgues

Learning with Missing Values: Theoretical Insights and Application to Health Databases

Centre de Conférences Marilyn et James Simons

Le Bois-Marie

Orateur

Description

Documents de présentation

Choisissez le fuseau horaire

9e Journée Statistique et Informatique pour la Science des Données à Paris-Saclay

Cécile Gourgues

Orateur

Description

Documents de présentation