Infinite Limits of Multi-head Transformer Dynamics

Open in new window