How Does Attention Help? Insights from Random Matrices on Signal Recovery from Sequence Models

Open in new window