Demystifying Contrastive Self-Supervised Learning: Invariances, Augmentations and Dataset Biases

Neural Information Processing Systems 

Self-supervised representation learning approaches have recently surpassed their supervised learning counterparts on downstream tasks like object detection and image classification. Somewhat mysteriously the recent gains in performance come from training instance classification models, treating each image and it's augmented versions as samples of a single class.