How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers

Open in new window