Harnessing noisy Web images for deep representation

Phong D. Vo; Alexandru Ginsca; Hervé Le Borgne; Adrian Popescu

doi:10.1016/j.cviu.2017.01.009

Article Dans Une Revue Computer Vision and Image Understanding Année : 2017

Harnessing noisy Web images for deep representation

(1) , (1) , (1) , (1)

Phong D. Vo

Fonction : Auteur

Département Intelligence Ambiante et Systèmes Interactifs

Alexandru Ginsca

Fonction : Auteur

Département Intelligence Ambiante et Systèmes Interactifs

Hervé Le Borgne

Fonction : Auteur
PersonId : 181478
IdHAL : herve-le-borgne
ORCID : 0000-0003-0520-8436
IdRef : 079208452

Département Intelligence Ambiante et Systèmes Interactifs

Adrian Popescu

Fonction : Auteur
PersonId : 868799

Département Intelligence Ambiante et Systèmes Interactifs

Résumé

The keep-growing content of Web images is probably the next important data source to scale up deep neural networks which recently surpass human in image classification tasks. The fact that deep networks are hungry for labelled data limits themselves from extracting valuable information of Web images which are abundant and cheap. There have been efforts to train neural networks such as autoencoders with respect to either unsupervised or semi-supervised settings. Nonetheless they are less performant than supervised methods partly because the loss function used in unsupervised methods, for instance Euclidean loss, failed to guide the network to learn discriminative features and ignore unnecessary details. We instead train convolutional networks in a supervised setting but use weakly labelled data which are large amounts of unannotated Web images downloaded from Flickr and Bing. Our experiments are conducted at several data scales, with different choices of network architecture, and alternating between different data preprocessing techniques. The effectiveness of our approach is shown by the good generalization of the learned representations with new six public datasets.

Mots clés

Domain adaptation Semi-supervised learning Convolutional networks Noisy data Representation learning Deep learning

Domaines

Informatique [cs] Traitement du signal et de l'image [eess.SP]

Léna Le Roy : Connectez-vous pour contacter le contributeur

https://cea.hal.science/cea-01756775

Soumis le : mardi 3 avril 2018-08:45:16

Dernière modification le : mercredi 3 avril 2024-11:14:12

Dates et versions

cea-01756775 , version 1 (03-04-2018)

Identifiants

HAL Id : cea-01756775 , version 1
DOI : 10.1016/j.cviu.2017.01.009

Citer

Phong D. Vo, Alexandru Ginsca, Hervé Le Borgne, Adrian Popescu. Harnessing noisy Web images for deep representation. Computer Vision and Image Understanding, 2017, 164, pp.68 - 81. ⟨10.1016/j.cviu.2017.01.009⟩. ⟨cea-01756775⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA DRT CEA-UPSAY UNIV-PARIS-SACLAY LIST GS-ENGINEERING GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT

102 Consultations

0 Téléchargements

Harnessing noisy Web images for deep representation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager