Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis

Open in new window