Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method

Open in new window