Boost Vision Transformer with GPU-Friendly Sparsity and Quantization

Open in new window