Structured Sparse Principal Components Analysis With the TV-Elastic Net Penalty

Abstract : Principal component analysis (PCA) is an exploratory tool widely used in data analysis to uncover dominant patterns of variability within a population. Despite its ability to represent a data set in a low-dimensional space, PCA's inter-pretability remains limited. Indeed, the components produced by PCA are often noisy or exhibit no visually meaningful patterns. Furthermore, the fact that the components are usually non-sparse may also impede interpretation, unless arbitrary thresholding is applied. However, in neuroimaging, it is essential to uncover clinically interpretable phenotypic markers that would account for the main variability in the brain images of a population. Recently, some alternatives to the standard PCA approach, such as Sparse PCA, have been proposed, their aim being to limit the density of the components. Nonetheless, sparsity alone does not entirely solve the interpretability problem in neuroimaging, since it may yield scattered and unstable components. We hypothesized that the incorporation of prior information regarding the structure of the data may lead to improved relevance and interpretability of brain patterns. We therefore present a simple extension of the popular PCA framework that adds structured sparsity penalties on the loading vectors in order to identify the few stable regions in the brain images that capture most of the variability. Such structured sparsity can be obtained by combining e.g., $l$1 and total variation (TV) penalties, where the TV regularization encodes information on the underlying structure of the data. This paper presents the structured sparse PCA (denoted SPCA-TV) optimization framework and its resolution. We demonstrate SPCA-TV's effectiveness and versatility on three different data sets. It can be applied to any kind of structured data, such as e.g., N-dimensional array images or meshes of cortical surfaces. The gains of SPCA-TV over unstructured approaches (such as Sparse PCA and ElasticNet PCA) or structured approach (such as GraphNet PCA) are significant, since SPCA-TV reveals the variability within a data set in the form of intelligible brain patterns that are easier to interpret and more stable across different samples.
Complete list of metadatas

https://hal-cea.archives-ouvertes.fr/cea-01883278
Contributor : Edouard Duchesnay <>
Submitted on : Thursday, September 27, 2018 - 10:44:26 PM
Last modification on : Friday, March 8, 2019 - 1:20:24 AM
Long-term archiving on : Friday, December 28, 2018 - 5:01:15 PM

File

dePierrefeu17-ieeetmi_pcatv_pr...
Publisher files allowed on an open archive

Identifiers

Citation

Amicie de Pierrefeu, Tommy Lofstedt, Fouad Hadj-Selem, Mathieu Dubois, Renaud Jardri, et al.. Structured Sparse Principal Components Analysis With the TV-Elastic Net Penalty. IEEE Transactions on Medical Imaging, Institute of Electrical and Electronics Engineers, 2018, 37 (2), pp.396 - 407. ⟨10.1109/tmi.2017.2749140⟩. ⟨cea-01883278⟩

Share

Metrics

Record views

238

Files downloads

96