High-Dimensional Analysis of Single-Layer Attention for Sparse-Token Classification

Open in new window