White-Box Transformers via Sparse Rate Reduction

Open in new window