Brain-Language Model Alignment: Insights into the Platonic Hypothesis and Intermediate-Layer Advantage

López-Cardona, Ángela, Idesis, Sebastián, Masias-Bruns, Mireia, Abadal, Sergi, Arapakis, Ioannis

Oct-22-2025–arXiv.org Artificial Intelligence

Do brains and language models converge toward the same internal representations of the world? Recent years have seen a rise in studies of neural activations and model alignment. In this work, we review 25 fMRI-based studies published between 2023 and 2025 and explicitly confront their findings with two key hypotheses: (i) the Platonic Representation Hypothesis -- that as models scale and improve, they converge to a representation of the real world, and (ii) the Intermediate-Layer Advantage -- that intermediate (mid-depth) layers often encode richer, more generalizable features. Our findings provide converging evidence that models and brains may share abstract representational structures, supporting both hypotheses and motivating further research on brain-model alignment.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Oct-22-2025

arXiv.org PDF

Add feedback

Country:
- Europe (1.00)
- Asia (0.92)
- North America > United States
  - Minnesota (0.28)

Genre:
- Research Report > New Finding (1.00)
- Overview (1.00)

Industry:
- Health & Medicine
  - Therapeutic Area > Neurology (1.00)
  - Health Care Technology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science > Neuroscience (0.87)
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found