Disentangling Superpositions: Interpretable Brain Encoding Model with Sparse Concept Atoms

Jun-13-2026, 04:59:36 GMT–Neural Information Processing Systems

Encoding models using word embeddings or artificial neural network (ANN) features reliably predict brain responses to naturalistic stimuli, yet interpreting these models remains challenging. A central limitation is superposition: distinct semantic features become entangled along correlated directions in dense embeddings when latent features outnumber embedding dimensions. This entanglement renders regression weights non-identifiable--different combinations of semantic directions can produce identical predictions, precluding principled interpretation of voxel selectivity. To address this, we introduce the Sparse Concept Encoding Model, which transforms dense embeddings into a higher-dimensional, sparse, non-negative space of learned concept atoms.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Jun-13-2026, 04:59:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)