Pruning Self-attentions into Convolutional Layers in Single Path

Open in new window