DCT-Former: Efficient Self-Attention with Discrete Cosine Transform

Open in new window