Stochastic Vision Transformers with Wasserstein Distance-Aware Attention

Open in new window