Transformers are uninterpretable with myopic methods: a case study with bounded Dyck grammars

Oct-8-2025, 22:53:49 GMT–Neural Information Processing Systems

Transformer interpretability aims to understand the algorithm implemented by a learned Transformer by examining various aspects of the model, such as the weight matrices or the attention patterns.

artificial intelligence, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Oct-8-2025, 22:53:49 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - California > San Diego County
    - San Diego (0.04)
- Europe
  - Switzerland (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - China > Hong Kong (0.04)
  - Middle East > Jordan (0.04)

Genre:
- Research Report > New Finding (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
79ba1b827d3fc58e129d1cbfc8ff69f2-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found