AITopics | superlinear convergence rate

Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence

Neural Information Processing SystemsApr-25-2026, 10:46:58 GMT

In this paper, we follow Rodomanov and Nesterov [19]'s work to study quasiNewton methods. We focus on the common SR1 and BFGS quasi-Newton methods to establish better explicit (local) superlinear convergence rates. First, based on the greedy quasi-Newton update which greedily selects the direction to maximize a certain measure of progress, we improve the convergence rate to a conditionnumber-free superlinear convergence rate. Second, based on the random quasiNewton update that selects the direction randomly from a spherically symmetric distribution, we show the same superlinear convergence rate established as above. Our analysis is closely related to the approximation of a given Hessian matrix, unconstrained quadratic objective, as well as the general strongly convex, smooth and strongly self-concordant functions.

artificial intelligence, convergence rate, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.46)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.63)

Add feedback

Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence

Neural Information Processing SystemsApr-25-2026, 10:46:54 GMT

In this paper, we follow Rodomanov and Nesterov [19]'s work to study quasiNewton methods. We focus on the common SR1 and BFGS quasi-Newton methods to establish better explicit (local) superlinear convergence rates. First, based on the greedy quasi-Newton update which greedily selects the direction to maximize a certain measure of progress, we improve the convergence rate to a conditionnumber-free superlinear convergence rate. Second, based on the random quasiNewton update that selects the direction randomly from a spherically symmetric distribution, we show the same superlinear convergence rate established as above. Our analysis is closely related to the approximation of a given Hessian matrix, unconstrained quadratic objective, as well as the general strongly convex, smooth and strongly self-concordant functions.

artificial intelligence, convergence, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.46)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.64)

Add feedback

GreedyandRandomQuasi-NewtonMethods withFasterExplicitSuperlinearConvergence

Neural Information Processing SystemsFeb-8-2026, 04:54:46 GMT

Theapproximationisupdatedin iterations based on some special formulas from the previous variation.

artificial intelligence, convergence rate, theorem4, (17 more...)

Neural Information Processing Systems

Country: Asia > Japan (0.04)

Technology: Information Technology > Artificial Intelligence (0.47)

Add feedback

347665597cbfaef834886adbb848011f-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 04:54:43 GMT

bfg update, convergence, convergence rate, (13 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Japan (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.64)

Add feedback

Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence

Neural Information Processing SystemsDec-23-2025, 23:47:39 GMT

In this paper, we follow Rodomanov and Nesterov's work to study quasi-Newton methods. We focus on the common SR1 and BFGS quasi-Newton methods to establish better explicit (local) superlinear convergence rates. First, based on the greedy quasi-Newton update which greedily selects the direction to maximize a certain measure of progress, we improve the convergence rate to a condition-number-free superlinear convergence rate. Second, based on the random quasi-Newton update that selects the direction randomly from a spherically symmetric distribution, we show the same superlinear convergence rate established as above. Our analysis is closely related to the approximation of a given Hessian matrix, unconstrained quadratic objective, as well as the general strongly convex, smooth, and strongly self-concordant functions.

faster explicit superlinear convergence, greedy and random quasi-newton method, superlinear convergence rate, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.70)

Add feedback

1e269abc604816c35f600ae14b354efd-Paper-Conference.pdf

Neural Information Processing SystemsOct-11-2025, 00:12:07 GMT

armijo-wolfe condition, convergence rate, log 2, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Lemont (0.04)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

Inference Acceleration of Autoregressive Normalizing Flows by Selective Jacobi Decoding

Zhang, Jiaru, Lu, Juanwu, Wang, Ziran, Zhang, Ruqi

arXiv.org Artificial IntelligenceJun-2-2025

Normalizing flows are promising generative models with advantages such as theoretical rigor, analytical log-likelihood computation, and end-to-end training. However, the architectural constraints to ensure invertibility and tractable Jacobian computation limit their expressive power and practical usability. Recent advancements utilize autoregressive modeling, significantly enhancing expressive power and generation quality. However, such sequential modeling inherently restricts parallel computation during inference, leading to slow generation that impedes practical deployment. In this paper, we first identify that strict sequential dependency in inference is unnecessary to generate high-quality samples. We observe that patches in sequential modeling can also be approximated without strictly conditioning on all preceding patches. Moreover, the models tend to exhibit low dependency redundancy in the initial layer and higher redundancy in subsequent layers. Leveraging these observations, we propose a selective Jacobi decoding (SeJD) strategy that accelerates autoregressive inference through parallel iterative optimization. Theoretical analyses demonstrate the method's superlinear convergence rate and guarantee that the number of iterations required is no greater than the original sequential approach. Empirical evaluations across multiple datasets validate the generality and effectiveness of our acceleration technique. Experiments demonstrate substantial speed improvements up to 4.7 times faster inference while keeping the generation quality and fidelity.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2505.24791

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence

Neural Information Processing SystemsOct-10-2024, 00:46:38 GMT

In this paper, we follow Rodomanov and Nesterov's work to study quasi-Newton methods. We focus on the common SR1 and BFGS quasi-Newton methods to establish better explicit (local) superlinear convergence rates. First, based on the greedy quasi-Newton update which greedily selects the direction to maximize a certain measure of progress, we improve the convergence rate to a condition-number-free superlinear convergence rate. Second, based on the random quasi-Newton update that selects the direction randomly from a spherically symmetric distribution, we show the same superlinear convergence rate established as above. Our analysis is closely related to the approximation of a given Hessian matrix, unconstrained quadratic objective, as well as the general strongly convex, smooth, and strongly self-concordant functions.

faster explicit superlinear convergence, greedy and random quasi-newton method, superlinear convergence rate, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.94)

Add feedback

Incremental Gauss--Newton Methods with Superlinear Convergence Rates

Zhou, Zhiling, Liu, Zhuanghua, Liu, Chengchang, Luo, Luo

arXiv.org Artificial IntelligenceJul-3-2024

This paper addresses the challenge of solving large-scale nonlinear equations with H\"older continuous Jacobians. We introduce a novel Incremental Gauss--Newton (IGN) method within explicit superlinear convergence rate, which outperforms existing methods that only achieve linear convergence rate. In particular, we formulate our problem by the nonlinear least squares with finite-sum structure, and our method incrementally iterates with the information of one component in each round. We also provide a mini-batch extension to our IGN method that obtains an even faster superlinear convergence rate. Furthermore, we conduct numerical experiments to show the advantages of the proposed methods.

equation, inequality, superlinear convergence rate, (14 more...)

arXiv.org Artificial Intelligence

2407.03195

Country:

Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)
Europe > Denmark (0.04)
Asia > Singapore (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.65)

Add feedback

Online Learning Guided Curvature Approximation: A Quasi-Newton Method with Global Non-Asymptotic Superlinear Convergence

Jiang, Ruichen, Jin, Qiujiang, Mokhtari, Aryan

arXiv.org Artificial IntelligenceJul-25-2023

Quasi-Newton algorithms are among the most popular iterative methods for solving unconstrained minimization problems, largely due to their favorable superlinear convergence property. However, existing results for these algorithms are limited as they provide either (i) a global convergence guarantee with an asymptotic superlinear convergence rate, or (ii) a local non-asymptotic superlinear rate for the case that the initial point and the initial Hessian approximation are chosen properly. In particular, no current analysis for quasi-Newton methods guarantees global convergence with an explicit superlinear convergence rate. In this paper, we close this gap and present the first globally convergent quasi-Newton method with an explicit non-asymptotic superlinear convergence rate. Unlike classical quasi-Newton methods, we build our algorithm upon the hybrid proximal extragradient method and propose a novel online learning framework for updating the Hessian approximation matrices. Specifically, guided by the convergence analysis, we formulate the Hessian approximation update as an online convex optimization problem in the space of matrices, and we relate the bounded regret of the online problem to the superlinear convergence of our method.

algorithm, artificial intelligence, optimization problem, (14 more...)

arXiv.org Artificial Intelligence

2302.0858

Country: North America > United States > Texas > Travis County > Austin (0.04)

Genre: Research Report > New Finding (0.67)

Industry: Education > Educational Setting > Online (0.62)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.62)

Add feedback

Filters

Collaborating Authors

superlinear convergence rate

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence

Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence

GreedyandRandomQuasi-NewtonMethods withFasterExplicitSuperlinearConvergence

347665597cbfaef834886adbb848011f-Paper.pdf

Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence

1e269abc604816c35f600ae14b354efd-Paper-Conference.pdf

Inference Acceleration of Autoregressive Normalizing Flows by Selective Jacobi Decoding

Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence

Incremental Gauss--Newton Methods with Superlinear Convergence Rates

Online Learning Guided Curvature Approximation: A Quasi-Newton Method with Global Non-Asymptotic Superlinear Convergence