AITopics | newton method

Partial Correlation Network Estimation by Semismooth Newton Methods

Neural Information Processing SystemsJun-23-2026, 02:11:26 GMT

We develop a scalable second-order algorithm for a recently proposed ℓ1regularized pseudolikelihood-based partial correlation network estimation framework. While the latter method admits statistical guarantees and is inherently scalable compared to likelihood-based methods such as graphical lasso, the currently available implementations rely only on first-order information and require thousands of iterations to obtain reliable estimates even on high-performance supercomputers. In this paper, we further investigate the inherent scalability of the framework and propose locally and globally convergent semismooth Newton methods. Despite the nonsmoothness of the problem, these second-order algorithms converge at a locally quadratic rate, and require only a few tens of iterations in practice. Each iteration reduces to solving linear systems of small dimensions or linear complementary problems of smaller dimensions, making the computation also suitable for less powerful computing environments. Experiments on both simulated and real-world genomic datasets demonstrate the superior convergence behavior and computational efficiency of the proposed algorithm, which position our method as a promising tool for massive-scale network analysis sought for in, e.g., modern multi-omics research.

artificial intelligence, iteration, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (0.46)
Health & Medicine > Pharmaceuticals & Biotechnology (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Convergence Analysis of Newton's Method for Neural Networks in the Overparameterized Limit

Riedl, Konstantin, Spiliopoulos, Konstantinos, Sirignano, Justin

arXiv.org Machine LearningMay-21-2026

A convergence analysis is developed for the regularized Newton method for training neural networks (NNs) in the overparameterized limit. As the number of hidden units tends to infinity, the NN training dynamics converge in probability to the solution of a deterministic limit equation involving a ``Newton neural tangent kernel'' (NNTK). Explicit rates characterizing this convergence are provided and, in the infinite-width limit, we prove that the NN converges exponentially fast to the target data (i.e., a global minimizer with zero loss). We show that this convergence is uniform across the frequency spectrum, addressing the spectral bias inherent in gradient descent. The eigenvalues of the NTK for gradient descent accumulate at zero, leading to slow convergence for target data with high-frequency components. In contrast, the NNTK has uniformly lower bounded eigenvalues if the regularization parameter is selected appropriately, allowing Newton's method to converge more quickly for data with high-frequency components. Mathematical challenges that need to be addressed in our analysis include the implicit parameter update of the Newton method with a potentially indefinite Hessian matrix and the fact that the dimension of this linear system of equations tends to infinity as the NN width grows. This complicates deriving the training dynamics in the overparameterized limit as well as proving the convergence of the finite-width dynamics thereto. The analysis identifies a scaling formula for selecting the regularization parameter, which we show can vanish at a suitable rate as the number of hidden units becomes larger. We prove that, for sufficiently large numbers of hidden units, the regularized Hessian remains positive definite during training and the Newton updates for individual NN parameters converge to zero, showing that the model behaves as a linearization around the initialization.

artificial intelligence, convergence, machine learning, (18 more...)

arXiv.org Machine Learning

2605.08352

Country:

North America > United States (1.00)
Europe (1.00)
North America > Canada (0.68)

Genre: Workflow (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

2c19666cbb2c14d45d39e2dcf6ab0b99-Paper-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 06:16:08 GMT

artificial intelligence, iteration, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Software (0.68)

Add feedback

00296c0e10cd24d415c2db63ea2a2c68-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 04:41:45 GMT

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.72)

Add feedback

Sub-sampled Newton Methods with Non-uniform Sampling

Peng Xu, Jiyan Yang, Fred Roosta, Christopher Ré, Michael W. Mahoney

Neural Information Processing SystemsMar-23-2026, 09:54:12 GMT

We consider the regime where nd. We propose randomized Newton-type algorithms that exploit non-uniform sub-sampling of { 2fi(w)}ni=1, as well as inexact updates, as means to reduce the computational complexity, and are applicable to a wide range of problems in machine learning. Two non-uniform sampling distributions based on block norm squares and block partial leverage scores are considered. Under certain assumptions, we show that our algorithms inherit a linear-quadratic convergence rate in w and achieve a lower computational complexity compared to similar existing methods. In addition, we show that our algorithms exhibit more robustness and better dependence on problem specific quantities, such as the condition number. We empirically demonstrate that our methods are at least twice as fast as Newton's methods on several real datasets.

artificial intelligence, leverage score, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

bc6dc48b743dc5d013b1abaebd2faed2-Paper.pdf

Neural Information Processing SystemsMar-13-2026, 06:55:04 GMT

algorithm 1, matrix, newton method, (14 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Saudi Arabia (0.04)
North America > United States > Connecticut > New Haven County > New Haven (0.04)
North America > Canada (0.04)
(5 more...)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)

Add feedback

2c19666cbb2c14d45d39e2dcf6ab0b99-Paper-Conference.pdf

Neural Information Processing SystemsFeb-19-2026, 00:14:08 GMT

algorithm, iteration, trust-region method, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Colorado (0.04)

Genre: Research Report (0.68)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Software (0.68)

Add feedback

Safe and Sparse Newton Method for Entropic-Regularized Optimal Transport

Neural Information Processing SystemsFeb-18-2026, 13:57:47 GMT

More recently, Newton-type methods using sparsified Hessian matrices have demonstrated promising results on OT computation, but there still remain a lot of unresolved open questions.

algorithm, artificial intelligence, machine learning, (20 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Europe > Switzerland (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Newton Informed Neural Operator for Solving Nonlinear Partial Differential Equations

Neural Information Processing SystemsFeb-18-2026, 08:47:33 GMT

These methods can be broadly categorized into two types: function learning and operator learning approaches. In function learning, the goal is to directly learn the solution.

artificial intelligence, machine learning, operator, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Mathematics of Computing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Stochastic Newton Proximal Extragradient Method

Neural Information Processing SystemsFeb-17-2026, 05:01:21 GMT

However, these methods typically reach superlinear convergence only when the stochastic Hessian noise diminishes, increasing per-iteration costs over time.

artificial intelligence, iteration, machine learning, (20 more...)

Neural Information Processing Systems

Country: North America > United States > Michigan (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology: