On the Surprising Effectiveness of Attention Transfer for Vision Transformers

Open in new window