AITopics | superposition

Collaborating Authors

superposition

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Probing for Representation Manifolds in Superposition

Modell, Alexander

arXiv.org Machine LearningMay-19-2026

This paper introduces the Manifold Probe, a supervised method for discovering representation manifolds in superposition. The method generalizes linear regression probes by learning the space of features of a concept that can be linearly predicted from the representations, and then learning the directions used to encode them. We demonstrate the probe on representations of time and space in Llama 2-7b, finding manifolds which linearly represent an interpretable set of features in each case. In the case of time, we show that by steering along the manifold, we can influence the model's completions about the years in which famous songs, movies and books were released, providing evidence that the Manifold Probe can discover manifolds which are causally involved in model behaviour.

large language model, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

2605.18537

Country: North America > United States (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.50)

Add feedback

A Resource scaling for quantum backpropagation methods

Neural Information Processing SystemsFeb-15-2026, 18:55:21 GMT

Both of the techniques assume bitwise access to the oracle as a classical function.

artificial intelligence, gradient, machine learning, (19 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.05)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Backpropagation (0.43)

Add feedback

MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting Computation in Superposition

Neural Information Processing SystemsFeb-15-2026, 10:41:55 GMT

Finally, we provide mathematical bounds on the interference between superposition channels in MIMOFormer.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Europe > Switzerland > Zürich > Zürich (0.05)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Tracr: Compiled Transformers as a Laboratory for Interpretability David Lindner

Neural Information Processing SystemsFeb-14-2026, 22:18:19 GMT

We show how to "compile" human-readable programs into standard decoder-only transformer models.

large language model, machine learning, selector, (22 more...)

Neural Information Processing Systems

Genre:

Overview (0.68)
Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Structured Variational Inference in Continuous Cox Process Models

Virginia Aglietti, Edwin V. Bonilla, Theodoros Damoulas, Sally Cripps

Neural Information Processing SystemsFeb-12-2026, 11:16:57 GMT

This view enables a structured variational approximation capturing dependencies across variables in the model.

artificial intelligence, intensity, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

Superposition of many models into one

Brian Cheung, Alexander Terekhov, Yubei Chen, Pulkit Agrawal, Bruno Olshausen

Neural Information Processing SystemsFeb-12-2026, 02:53:10 GMT

Neural Information Processing Systems http://nips.cc/

arxiv preprint arxiv, neural network, superposition, (13 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Education > Educational Setting > Online (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

c88d0c9bea6230b518ce71268c8e49e0-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 21:30:50 GMT

This paper presents a text classification algorithm inspired by the notion of superposition of states in quantum physics.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland > Neuchâtel > Neuchâtel (0.04)
(2 more...)

Genre: Research Report (0.68)

Industry:

Leisure & Entertainment (0.47)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

78f7d96ea21ccae89a7b581295f34135-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 01:13:55 GMT

The rates for representing the class of functionsGD via DReLU layers is sharpuptoconstants,asshownbymatchinglowerbounds.

artificial intelligence, arxivpreprintarxiv, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Africa > South Sudan > Bahr el Ghazal > Warrap State > Tonj (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

Superposition unifies power-law training dynamics

Chen, Zixin Jessie, Chen, Hao, Liu, Yizhou, Gore, Jeff

arXiv.org Machine LearningFeb-3-2026

We investigate the role of feature superposition in the emergence of power-law training dynamics using a teacher-student framework. We first derive an analytic theory for training without superposition, establishing that the power-law training exponent depends on both the input data statistics and channel importance. Remarkably, we discover that a superposition bottleneck induces a transition to a universal power-law exponent of $\sim 1$, independent of data and channel statistics. This one over time training with superposition represents an up to tenfold acceleration compared to the purely sequential learning that takes place in the absence of superposition. Our finding that superposition leads to rapid training with a data-independent power law exponent may have important implications for a wide range of neural networks that employ superposition, including production-scale large language models.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Machine Learning

2602.01045

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > United States > Illinois (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Spectral Superposition: A Theory of Feature Geometry

Ivanov, Georgi, Oozeer, Narmeen, Raval, Shivam, Pejovic, Tasana, Upadhyay, Shriyash, Abdullah, Amir

arXiv.org Machine LearningFeb-3-2026

Neural networks represent more features than they have dimensions via superposition, forcing features to share representational space. Current methods decompose activations into sparse linear features but discard geometric structure. We develop a theory for studying the geometric structre of features by analyzing the spectra (eigenvalues, eigenspaces, etc.) of weight derived matrices. In particular, we introduce the frame operator $F = WW^\top$, which gives us a spectral measure that describes how each feature allocates norm across eigenspaces. While previous tools could describe the pairwise interactions between features, spectral methods capture the global geometry (``how do all features interact?''). In toy models of superposition, we use this theory to prove that capacity saturation forces spectral localization: features collapse onto single eigenspaces, organize into tight frames, and admit discrete classification via association schemes, classifying all geometries from prior work (simplices, polygons, antiprisms). The spectral measure formalism applies to arbitrary weight matrices, enabling diagnosis of feature localization beyond toy settings. These results point toward a broader program: applying operator theory to interpretability.

artificial intelligence, machine learning, matrix, (17 more...)

arXiv.org Machine Learning

2602.02224

Country: