How Classifier Features Transfer to Downstream: An Asymptotic Analysis in a Two-Layer Model

Open in new window