Long-ShortTransformer: EfficientTransformers forLanguageandVision(Appendix) ADetailsofNormComparisons
–Neural Information Processing Systems
The first design helps the model focus more on the global context of the image as each patch could attend to the whole image areas. It reduces the local texture bias ofCNN.
adetailsofnormcomparison, artificial intelligence, efficienttransformer forlanguageandvision, (16 more...)
Neural Information Processing Systems
Feb-9-2026, 23:59:19 GMT
- Technology: