Debiasing Stochastic Gradient Descent to handle missing values - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Debiasing Stochastic Gradient Descent to handle missing values

Résumé

A major caveat of large scale data is their incom-pleteness. We propose an averaged stochastic gradient algorithm handling missing values in linear models. This approach has the merit to be free from the need of any data distribution modeling and to account for heterogeneous missing proportion. In both streaming and finite-sample settings, we prove that this algorithm achieves convergence rate of O(1 n) at the iteration n, the same as without missing values. We show the convergence behavior and the relevance of the algorithm not only on synthetic data but also on real data sets, including those collected from medical register.
Fichier principal
Vignette du fichier
AvSGD_NA.pdf (576.94 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02483651 , version 1 (21-02-2020)
hal-02483651 , version 2 (04-06-2020)

Identifiants

Citer

Aude Sportisse, Claire Boyer, Aymeric Dieuleveut, Julie Josse. Debiasing Stochastic Gradient Descent to handle missing values. 2020. ⟨hal-02483651v1⟩
281 Consultations
362 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More