AITopics | natural gradient

Collaborating Authors

natural gradient

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

2d290e496d16c9dcaa9b4ded5cac10cc-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 07:24:05 GMT

artificial intelligence, fim, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Exact natural gradient in deep linear networks and its application to the nonlinear case

Neural Information Processing SystemsMar-16-2026, 22:29:50 GMT

Stochastic gradient descent (SGD) remains the method of choice for deep learning, despite the limitations arising for ill-behaved objective functions. In cases where it could be estimated, the natural gradient has proven very effective at mitigating the catastrophic effects of pathological curvature in the objective function, but little is known theoretically about its convergence properties, and it has yet to find a practical implementation that would scale to very deep and large networks. Here, we derive an exact expression for the natural gradient in deep linear networks, which exhibit pathological curvature similar to the nonlinear case. We provide for the first time an analytical solution for its convergence rate, showing that the loss decreases exponentially to the global minimum in parameter space. Our expression for the natural gradient is surprisingly simple, computationally tractable, and explains why some approximations proposed previously work well in practice. This opens new avenues for approximating the natural gradient in the nonlinear case, and we show in preliminary experiments that our online natural gradient descent outperforms SGD on MNIST autoencoding while sharing its computational simplicity.

artificial intelligence, machine learning, natural gradient, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.83)

Add feedback

b2913cff905a649c5bda3ce2cd19088c-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 13:56:35 GMT

artificial intelligence, machine learning, physs, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Exact natural gradient in deep linear networks and its application to the nonlinear case

Alberto Bernacchia, Mate Lengyel, Guillaume Hennequin

Neural Information Processing SystemsFeb-13-2026, 10:34:26 GMT

Neural Information Processing Systems http://nips.cc/

gradient, matrix, natural gradient, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > Canada > Quebec > Montreal (0.04)
Europe > Hungary > Budapest > Budapest (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.71)

Add feedback

Gaussian Process Conditional Density Estimation

Vincent Dutordoir, Hugh Salimbeni, James Hensman, Marc Deisenroth

Neural Information Processing SystemsFeb-13-2026, 02:26:06 GMT

Neural Information Processing Systems http://nips.cc/

dataset, gaussian process, latent variable, (10 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Transportation (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

3c1e4bd67169b8153e0047536c9f541e-Paper.pdf

Neural Information Processing SystemsFeb-12-2026, 15:52:50 GMT

algorithm, annotation, bayesscdc, (11 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Data Science > Data Mining (0.94)
Information Technology > Communications > Social Media > Crowdsourcing (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)
(2 more...)

Add feedback

7b41bfa5085806dfa24b8c9de0ce567f-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 02:13:47 GMT

approximation, convergence, neural network, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.42)

Add feedback

Dual Parameterization of Sparse Variational Gaussian Processes Vincent Adam

Neural Information Processing SystemsFeb-8-2026, 22:48:55 GMT

' refers to the sites).

artificial intelligence, machine learning, parameterization, (17 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Finland (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

On the Variance of the Fisher Information for Deep Learning

Neural Information Processing SystemsFeb-8-2026, 01:34:40 GMT

In the realm of deep learning, the Fisher information matrix (FIM) gives novel insights and useful tools to characterize the loss landscape, perform second-order optimization, and build geometric learning theories. The exact FIM is either unavailable in closed form or too expensive to compute. In practice, it is almost always estimated based on empirical samples. We investigate two such estimators based on two equivalent representations of the FIM -- both unbiased and consistent. Their estimation quality is naturally gauged by their variance given in closed form. We analyze how the parametric structure of a deep neural network can affect the variance. The meaning of this variance measure and its upper bounds are then discussed in the context of deep learning.

artificial intelligence, fim, machine learning, (19 more...)

Neural Information Processing Systems

Country: Oceania > Australia (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback