White-Box Transformers via Sparse Rate Reduction

Feb-8-2026, 16:16:29 GMT–Neural Information Processing Systems

In Section 2.2 we show, using an idealized model for the token distribution, that if one iteratively

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Feb-8-2026, 16:16:29 GMT

Conferences PDF

Country:
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.93)
  - Artificial Intelligence
    - Vision (1.00)
    - Representation & Reasoning (1.00)
    - Natural Language (0.93)
    - Machine Learning
      - Neural Networks > Deep Learning (1.00)
      - Statistical Learning (0.69)

Duplicate Docs Excel Report

Title
1e118ba9ee76c20df728b42a35fb4704-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found