Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers

Open in new window