Polysemy of Synthetic Neurons Towards a New Type of Explanatory Categorical Vector Spaces
Pichat, Michael, Pogrund, William, Pichat, Paloma, Poumay, Judicael, Gasparian, Armanouche, Demarchi, Samuel, Corbet, Martin, Georgeon, Alois, Veillet-Guillem, Michael
–arXiv.org Artificial Intelligence
The polysemantic nature of synthetic neurons in artificial intelligence language models is currently understood as the result of a necessary superposition of distributed features within the latent space. We propose an alternative approach, geometrically defining a neuron in layer n as a categorical vector space with a non-orthogonal basis, composed of categorical sub-dimensions extracted from preceding neurons in layer n-1. This categorical vector space is structured by the activation space of each neuron and enables, via an intra-neuronal attention process, the identification and utilization of a critical categorical zone for the efficiency of the language model - more homogeneous and located at the intersection of these different categorical sub-dimensions.
arXiv.org Artificial Intelligence
May-14-2025
- Country:
- Asia
- Europe
- France > Auvergne-Rhône-Alpes
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- North America > United States
- California (0.04)
- Illinois (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York (0.04)
- Genre:
- Overview (1.00)
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Neuroscience (0.67)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (1.00)
- Natural Language
- Chatbot (0.93)
- Large Language Model (1.00)
- Text Processing (1.00)
- Representation & Reasoning > Ontologies (0.67)
- Vision (0.93)
- Information Technology > Artificial Intelligence