Whitening for Self-Supervised Representation Learning
Ermolov, Aleksandr, Siarohin, Aliaksandr, Sangineto, Enver, Sebe, Nicu
Most of the self-supervised representation learning methods are based on the contrastive loss and the instance-discrimination task, where augmented versions of the same image instance ("positives") are contrasted with instances extracted from other images ("negatives"). For the learning to be effective, a lot of negatives should be compared with a positive pair, which is computationally demanding. In this paper, we propose a different direction and a new loss function for selfsupervised representation learning which is based on the whitening of the latentspace features. The whitening operation has a "scattering" effect on the batch samples, which compensates the use of negatives, avoiding degenerate solutions where all the sample representations collapse to a single point. Our Whitening MSE (W-MSE) loss does not require special heuristics (e.g. Since negatives are not needed, we can extract multiple positive pairs from the same image instance. We empirically show that W-MSE is competitive with respect to popular, more complex self-supervised methods.
Oct-4-2020
- Country:
- Europe > Italy
- Trentino-Alto Adige/Südtirol > Trentino Province > Trento (0.04)
- Asia > Myanmar
- Tanintharyi Region > Dawei (0.04)
- Europe > Italy
- Genre:
- Research Report (1.00)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Representation & Reasoning (0.93)
- Natural Language (0.93)
- Vision (0.93)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks (1.00)
- Supervised Learning (0.68)
- Information Technology