Contrastive predictive coding for video representation learning - Archive ouverte HAL Access content directly
Conference Papers Year :

Contrastive predictive coding for video representation learning

(1) , (1) , (1) , (2) , (2)
1
2

Abstract

Contrastive Predictive Coding (CPC) (van den Oord et al., 2018) has been successfully used to learn representations for different signals (audio, text, images). It uses an autoregressive modeling and contrastive estimation to learn long-term temporal relation inside the raw signal while remaining robust to local noise. The result is a higher level signal representation useful to solve downstream tasks. Using CPC to learn representations for videos remains challenging due to the structure and the high dimensionality of the signal. In this work, we propose different implementations of CPC for video signal. The learned representation increases the performance of an action recognition classifier.
Fichier principal
Vignette du fichier
video_cpc.pdf (294.36 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

cea-03547497 , version 1 (28-01-2022)

Identifiers

  • HAL Id : cea-03547497 , version 1

Cite

Guillaume Lorre, Jaonary Rabarisoa, Astrid Orcesi, Samia Ainouz, Stéphane Canu. Contrastive predictive coding for video representation learning. ICML2019 - 36th International Conference on Machine Learning - Workshop on Self-Supervised Learning, Jun 2019, Long Beach, United States. ⟨cea-03547497⟩
119 View
127 Download

Share

Gmail Facebook Twitter LinkedIn More