Semi-supervised Vision Transformers at Scale

Neural Information Processing Systems 

We study semi-supervised learning (SSL) for vision transformers (ViT), an under-explored topic despite the wide adoption of the ViT architecture to different tasks.