Convolutional Embedding Makes Hierarchical Vision Transformer Stronger

Open in new window