Self-supervised Vision Transformer are Scalable Generative Models for Domain Generalization