Linking In-context Learning in Transformers to Human Episodic Memory

May-26-2025, 15:56:51 GMT–Neural Information Processing Systems

Understanding connections between artificial and biological intelligent systems can reveal fundamental principles of general intelligence. While many artificial intelligence models have a neuroscience counterpart, such connections are largely missing in Transformer models and the self-attention mechanism. Here, we examine the relationship between interacting attention heads and human episodic memory. We focus on induction heads, which contribute to in-context learning in Transformer-based large language models (LLMs). We demonstrate that induction heads are behaviorally, functionally, and mechanistically similar to the contextual maintenance and retrieval (CMR) model of human episodic memory.

artificial intelligence, large language model, natural language, (5 more...)

Neural Information Processing Systems

May-26-2025, 15:56:51 GMT

Conferences Web Page

Add feedback

Industry:
- Health & Medicine > Consumer Health (0.91)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Representation & Reasoning > Scripts & Frames (0.91)