Two-Scale Latent Dynamics for Recurrent-Depth Transformers

Open in new window