Dual PatchNorm
Kumar, Manoj, Dehghani, Mostafa, Houlsby, Neil
–arXiv.org Artificial Intelligence
We propose Dual PatchNorm: two Layer Normalization layers (LayerNorms), before and after the patch embedding layer in Vision Transformers. We demonstrate that Dual Patch-Norm outperforms the result of exhaustive search for alternative LayerNorm placement strategies in the Transformer block itself. In our experiments on image classification, contrastive learning, semantic segmentation and transfer on downstream classification datasets, incorporating this trivial modification, often leads to improved accuracy over well-tuned vanilla Vision Transformers and never hurts.
arXiv.org Artificial Intelligence
May-8-2023
- Genre:
- Research Report > New Finding (0.48)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Machine Learning (1.00)
- Representation & Reasoning > Search (0.34)
- Cognitive Science > Problem Solving (0.34)
- Information Technology > Artificial Intelligence