Automatic Channel Pruning for Multi-Head Attention

Open in new window