AITopics | overparameterization

Collaborating Authors

overparameterization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Escape dynamics and implicit bias of one-pass SGD in overparameterized quadratic networks

Bocchi, Dario, Regimbeau, Theotime, Lucibello, Carlo, Saglietti, Luca, Cammarota, Chiara

arXiv.org Machine LearningApr-6-2026

We analyze the one-pass stochastic gradient descent dynamics of a two-layer neural network with quadratic activations in a teacher--student framework. In the high-dimensional regime, where the input dimension $N$ and the number of samples $M$ diverge at fixed ratio $α= M/N$, and for finite hidden widths $(p,p^*)$ of the student and teacher, respectively, we study the low-dimensional ordinary differential equations that govern the evolution of the student--teacher and student--student overlap matrices. We show that overparameterization ($p>p^*$) only modestly accelerates escape from a plateau of poor generalization by modifying the prefactor of the exponential decay of the loss. We then examine how unconstrained weight norms introduce a continuous rotational symmetry that results in a nontrivial manifold of zero-loss solutions for $p>1$. From this manifold the dynamics consistently selects the closest solution to the random initialization, as enforced by a conserved quantity in the ODEs governing the evolution of the overlaps. Finally, a Hessian analysis of the population-loss landscape confirms that the plateau and the solution manifold correspond to saddles with at least one negative eigenvalue and to marginal minima in the population-loss geometry, respectively.

artificial intelligence, machine learning, matrix, (18 more...)

arXiv.org Machine Learning

2604.03068

Country:

Europe > Italy > Lombardy > Milan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Italy > Lazio > Rome (0.04)

Genre: Research Report (0.82)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)

Add feedback

Semi-flat minima and saddle points by embedding neural networks to overparameterization

Kenji Fukumizu, Shoichiro Yamaguchi, Yoh-ichi Mototake, Mirai Tanaka

Neural Information Processing SystemsFeb-13-2026, 09:20:32 GMT

We theoretically study the landscape of the training error for neural networks in overparameterized cases.

artificial intelligence, machine learning, training error, (16 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
North America > United States (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

fb8fe6b79288f3d83696a5d276f4fc9d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-13-2026, 01:35:12 GMT

generalization, international conference, neural network, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Hong Kong > Kowloon (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.98)

Add feedback

f115f619b62833aadc5acb058975b0e6-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 19:42:09 GMT

eigenvalue, graph, matrix, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > Canada > British Columbia > Vancouver (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.41)

Add feedback

f115f619b62833aadc5acb058975b0e6-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 19:42:05 GMT

eigenvalue, learning, matrix, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > Canada > British Columbia > Vancouver (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(3 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.41)

Add feedback

47908cab4e5b696d7af5c7de69f3b7d2-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 07:58:38 GMT

artificial intelligence, log 2 2, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > France (0.14)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
North America > Canada > British Columbia > Vancouver (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre:

Contests & Prizes (1.00)
Research Report > Experimental Study (0.93)
Workflow (0.69)

Industry: Leisure & Entertainment (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

TowardsSample-efficientOverparameterized Meta-learning

Neural Information Processing SystemsFeb-11-2026, 18:47:24 GMT

An overarching goal in machine learning is to build ageneralizable model with fewsamples.

artificial intelligence, machine learning, representation, (16 more...)

Neural Information Processing Systems

Country: Oceania > Australia > New South Wales > Sydney (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

TowardsSample-efficientOverparameterized Meta-learning

Neural Information Processing SystemsFeb-11-2026, 18:47:20 GMT

We then integrate these findings toobtain anoverallperformance guarantee forourmetalearning algorithm.

artificial intelligence, machine learning, representation, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

c82836ed448c41094025b4a872c5341e-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 03:23:25 GMT

Recently there has been significant theoretical progress on understanding the convergence andgeneralization ofgradient-based methods onnonconvexlosses withoverparameterized models. Nevertheless, manyaspectsofoptimization and generalization and in particular the critical role of small random initialization are not fully understood.

artificial intelligence, initialization, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.05)
Asia > Middle East > Jordan (0.05)
North America > United States > Maryland > Baltimore (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Ingolstadt (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.52)

Add feedback

Filters

Collaborating Authors

overparameterization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

1165af8b913fb836c6280b42d6e0084f-Supplemental-Conference.pdf

Escape dynamics and implicit bias of one-pass SGD in overparameterized quadratic networks

Semi-flat minima and saddle points by embedding neural networks to overparameterization

fb8fe6b79288f3d83696a5d276f4fc9d-Paper-Conference.pdf

f115f619b62833aadc5acb058975b0e6-Supplemental-Conference.pdf

f115f619b62833aadc5acb058975b0e6-Paper-Conference.pdf

47908cab4e5b696d7af5c7de69f3b7d2-Paper-Conference.pdf

TowardsSample-efficientOverparameterized Meta-learning

TowardsSample-efficientOverparameterized Meta-learning

c82836ed448c41094025b4a872c5341e-Paper.pdf