Benign Overfitting in Single-Head Attention

Open in new window