Intriguing Propertiesof Vision Transformers

Neural Information Processing Systems 

Inthecaseof classification (left), the ImageNetpre-trained ViTstransferbetterthantheir CNNcounterpartsacrosstasks.