Entropy and mutual information in models of deep neural networks

Marylou Gabrié; Andre Manoel; Clément Luneau; Jean Barbier; Nicolas Macris; Florent Krzakala; Lenka Zdeborová

doi:10.1088/1742-5468/ab3430

Article Dans Une Revue Journal of Statistical Mechanics: Theory and Experiment Année : 2019

Entropy and mutual information in models of deep neural networks

(1) , (2) , (3) , (4) , (3) , (1) , (5)

1
2
3
4
5

Marylou Gabrié

Fonction : Auteur correspondant
PersonId : 1078614

Connectez-vous pour contacter l'auteur

Laboratoire de Physique Statistique de l'ENS

Andre Manoel

Fonction : Auteur

Owkin, Inc. [New York, NY, États-Unis]

Clément Luneau

Fonction : Auteur

Ecole Polytechnique Fédérale de Lausanne

Jean Barbier

Fonction : Auteur

Abdus Salam International Centre for Theoretical Physics [Trieste]

Nicolas Macris

Fonction : Auteur

Ecole Polytechnique Fédérale de Lausanne

Florent Krzakala

Fonction : Auteur
PersonId : 1179607
ORCID : 0000-0003-2313-2578
IdRef : 070360715

Laboratoire de Physique Statistique de l'ENS

Lenka Zdeborová

Fonction : Auteur
PersonId : 1234977
ORCID : 0000-0002-8377-3978
IdRef : 128058153

Institut de Physique Théorique - UMR CNRS 3681

Résumé

We examine a class of stochastic deep learning models with a tractable method to compute information-theoretic quantities. Our contributions are three-fold: (i) We show how entropies and mutual informations can be derived from heuristic statistical physics methods, under the assumption that weight matrices are independent and orthogonally-invariant. (ii) We extend particular cases in which this result is known to be rigorously exact by providing a proof for two-layers networks with Gaussian random weights, using the recently introduced adaptive interpolation method. (iii) We propose an experiment framework with generative models of synthetic datasets, on which we train deep neural networks with a weight constraint designed so that the assumption in (i) is verified during learning. We study the behavior of entropies and mutual informations throughout learning and conclude that, in the proposed setting, the relationship between compression and generalization remains elusive.

Domaines

Physique [physics]

Fichier principal

publi.pdf (3.13 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuelle De Laborderie : Connectez-vous pour contacter le contributeur

https://cea.hal.science/cea-01930228

Soumis le : mercredi 21 novembre 2018-16:50:52

Dernière modification le : vendredi 19 avril 2024-16:18:59

Archivage à long terme le : vendredi 22 février 2019-15:47:49

Dates et versions

cea-01930228 , version 1 (21-11-2018)

Licence

Paternité

Identifiants

HAL Id : cea-01930228 , version 1
DOI : 10.1088/1742-5468/ab3430

Citer

Marylou Gabrié, Andre Manoel, Clément Luneau, Jean Barbier, Nicolas Macris, et al.. Entropy and mutual information in models of deep neural networks. Journal of Statistical Mechanics: Theory and Experiment, 2019, 19, pp.124014. ⟨10.1088/1742-5468/ab3430⟩. ⟨cea-01930228⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA UNIV-PARIS7 ENS-PARIS LPS CNRS DSM-IPHT CEA-UPSAY PSL USPC UNIV-PARIS-SACLAY CEA-DRF SORBONNE-UNIVERSITE SU-SCIENCES UP-SCIENCES ANR GS-MATHEMATIQUES GS-PHYSIQUE URP-LPS

119 Consultations

985 Téléchargements

Entropy and mutual information in models of deep neural networks

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager