Learning Imbalanced Data with Vision Transformers