Polysemy of Synthetic Neurons Towards a New Type of Explanatory Categorical Vector Spaces
Pichat, Michael, Pogrund, William, Pichat, Paloma, Poumay, Judicael, Gasparian, Armanouche, Demarchi, Samuel, Corbet, Martin, Georgeon, Alois, Veillet-Guillem, Michael
–arXiv.org Artificial Intelligence
The polysemantic nature of synthetic neurons in artificial intelligence language models is currently understood as the result of a necessary superposition of distributed features within the latent space. We propose an alternative approach, geometrically defining a neuron in layer n as a categorical vector space with a non-orthogonal basis, composed of categorical sub-dimensions extracted from preceding neurons in layer n-1. This categorical vector space is structured by the activation space of each neuron and enables, via an intra-neuronal attention process, the identification and utilization of a critical categorical zone for the efficiency of the language model - more homogeneous and located at the intersection of these different categorical sub-dimensions.
arXiv.org Artificial Intelligence
May-14-2025
- Country:
- Europe (0.92)
- North America > United States (0.67)
- Genre:
- Overview (1.00)
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Neuroscience (0.67)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (1.00)
- Supervised Learning > Representation Of Examples (0.82)
- Natural Language
- Chatbot (0.93)
- Large Language Model (1.00)
- Text Processing (1.00)
- Representation & Reasoning > Ontologies (0.67)
- Information Technology > Artificial Intelligence