AITopics | deep neural collapse

a60c43ba078b723d3d517d28c50ded4c-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 08:37:55 GMT

artificial intelligence, machine learning, neural collapse, (16 more...)

Neural Information Processing Systems

Country: Europe > Austria (0.04)

Genre: Research Report (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model

Neural Information Processing SystemsDec-26-2025, 12:13:32 GMT

Neural collapse (NC) refers to the surprising structure of the last layer of deep neural networks in the terminal phase of gradient descent training. Recently, an increasing amount of experimental evidence has pointed to the propagation of NC to earlier layers of neural networks. However, while the NC in the last layer is well studied theoretically, much less is known about its multi-layered counterpart - deep neural collapse (DNC). In particular, existing work focuses either on linear layers or only on the last two layers at the price of an extra assumption. Our work fills this gap by generalizing the established analytical framework for NC - the unconstrained features model - to multiple non-linear layers. Our key technical contribution is to show that, in a deep unconstrained features model, the unique global optimum for binary classification exhibits all the properties typical of DNC. This explains the existing experimental evidence of DNC. We also empirically show that (i) by optimizing deep unconstrained features models via gradient descent, the resulting solution agrees well with our theory, and (ii) trained networks recover the unconstrained features suitable for the occurrence of DNC, thus supporting the validity of this modeling principle.

deep neural collapse, deep unconstrained feature model, provably optimal, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.83)

Add feedback

a60c43ba078b723d3d517d28c50ded4c-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 03:50:42 GMT

artificial intelligence, machine learning, neural collapse, (16 more...)

Neural Information Processing Systems

Country: Europe > Austria (0.04)

Genre: Research Report (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Average gradient outer product as a mechanism for deep neural collapse

Neural Information Processing SystemsMay-27-2025, 20:34:13 GMT

Deep Neural Collapse (DNC) refers to the surprisingly rigid structure of the data representations in the final layers of Deep Neural Networks (DNNs). Though the phenomenon has been measured in a variety of settings, its emergence is typically explained via data-agnostic approaches, such as the unconstrained features model. In this work, we introduce a data-dependent setting where DNC forms due to feature learning through the average gradient outer product (AGOP). The AGOP is defined with respect to a learned predictor and is equal to the uncentered covariance matrix of its input-output gradients averaged over the training dataset. Deep Recursive Feature Machines are a method that constructs a neural network by iteratively mapping the data with the AGOP and applying an untrained random feature map.

average gradient outer product, deep neural collapse, gradient outer product, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)

Add feedback

Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model

Neural Information Processing SystemsJan-19-2025, 18:12:28 GMT

Neural collapse (NC) refers to the surprising structure of the last layer of deep neural networks in the terminal phase of gradient descent training. Recently, an increasing amount of experimental evidence has pointed to the propagation of NC to earlier layers of neural networks. However, while the NC in the last layer is well studied theoretically, much less is known about its multi-layered counterpart - deep neural collapse (DNC). In particular, existing work focuses either on linear layers or only on the last two layers at the price of an extra assumption. Our work fills this gap by generalizing the established analytical framework for NC - the unconstrained features model - to multiple non-linear layers.

deep neural collapse, deep unconstrained feature model, provably optimal, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model

Súkeník, Peter, Mondelli, Marco, Lampert, Christoph

arXiv.org Artificial IntelligenceMay-22-2023

Neural collapse (NC) refers to the surprising structure of the last layer of deep neural networks in the terminal phase of gradient descent training. Recently, an increasing amount of experimental evidence has pointed to the propagation of NC to earlier layers of neural networks. However, while the NC in the last layer is well studied theoretically, much less is known about its multi-layered counterpart - deep neural collapse (DNC). In particular, existing work focuses either on linear layers or only on the last two layers at the price of an extra assumption. Our paper fills this gap by generalizing the established analytical framework for NC - the unconstrained features model - to multiple non-linear layers. Our key technical contribution is to show that, in a deep unconstrained features model, the unique global optimum for binary classification exhibits all the properties typical of DNC. This explains the existing experimental evidence of DNC. We also empirically show that (i) by optimizing deep unconstrained features models via gradient descent, the resulting solution agrees well with our theory, and (ii) trained networks recover the unconstrained features suitable for the occurrence of DNC, thus supporting the validity of this modeling principle.

artificial intelligence, machine learning, neural collapse, (16 more...)

arXiv.org Artificial Intelligence

2305.13165

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Collaborating Authors

deep neural collapse

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

a60c43ba078b723d3d517d28c50ded4c-Paper-Conference.pdf

Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model

a60c43ba078b723d3d517d28c50ded4c-Paper-Conference.pdf

Average gradient outer product as a mechanism for deep neural collapse

Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model

Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model