QT-ViT: Improving Linear Attention in ViT with Quadratic Taylor Expansion

Neural Information Processing Systems 

In this paper, we propose QT -ViT models that improve the previous linear self-attention using quadratic Taylor expansion.