A simple, efficient and scalable contrastive masked autoencoder for learning visual representations

Open in new window