Interpretable Next-token Prediction via the Generalized Induction Head

Jun-21-2026, 12:51:36 GMT–Neural Information Processing Systems

While large transformer models excel in predictive performance, their lack of interpretability restricts their usefulness in high-stakes domains. To remedy this, we propose the Generalized Induction-Head Model (GIM), an interpretable model for next-token prediction inspired by the observation of "induction heads" in LLMs. GIM is a retrieval-based module that identifies similar sequences in the input context by combining exact n-gram matching and fuzzy matching based on a neural similarity metric. We evaluate GIM in two settings: language modeling and fMRI response prediction. In language modeling, GIM improves next-token prediction by up to 25%p over interpretable baselines, significantly narrowing the gap with black-box LLMs. In an fMRI setting, GIM improves neural response prediction by 20% and offers insight into the language selectivity of the brain. GIM represents a significant step toward uniting interpretability and performance across domains.

large language model, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Jun-21-2026, 12:51:36 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.28)
- Asia > Middle East (0.28)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Health & Medicine
  - Therapeutic Area > Neurology (1.00)
  - Health Care Technology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.98)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found