On the Relationship between Self-Attention and Convolutional Layers

Open in new window