Unsupervised Object-Level Representation Learning from Scene Images

Neural Information Processing Systems 

Our key insight is to leverage image-level self-supervised pre-training as the prior to discover object-level semantic correspondence, thus realizing object-level representation learning from scene images.