Training Dynamics of In-Context Learning in Linear Attention

Open in new window