AITopics | newton direction

Collaborating Authors

newton direction

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

bc6dc48b743dc5d013b1abaebd2faed2-Paper.pdf

Neural Information Processing SystemsMar-13-2026, 06:55:04 GMT

algorithm 1, matrix, newton method, (14 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Saudi Arabia (0.04)
North America > United States > Connecticut > New Haven County > New Haven (0.04)
North America > Canada (0.04)
(5 more...)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)

Add feedback

d78ca21959823cf8aa8e26d359cf1105-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 08:02:51 GMT

artificial intelligence, bayesian inference, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Basel-City > Basel (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Energy (0.68)
Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.93)
Information Technology > Data Science (0.93)
(4 more...)

Add feedback

A Stein variational Newton method

Gianluca Detommaso, Tiangang Cui, Youssef Marzouk, Alessio Spantini, Robert Scheichl

Neural Information Processing SystemsFeb-15-2026, 09:43:13 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, approximation, kernel, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts (0.04)
Oceania > Australia (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.41)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.32)

Add feedback

A Stein variational Newton method

Gianluca Detommaso, Tiangang Cui, Youssef Marzouk, Alessio Spantini, Robert Scheichl

Neural Information Processing SystemsNov-20-2025, 21:28:00 GMT

We also show how second-order information can lead to more effective choices of kernel.

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts (0.04)
Oceania > Australia (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.41)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.32)

Add feedback

Distributed Multitask Reinforcement Learning with Quadratic Convergence

Rasul Tutunov, Dongho Kim, Haitham Bou Ammar

Neural Information Processing SystemsNov-20-2025, 17:48:08 GMT

Multitask reinforcement learning (MTRL) suffers from scalability issues when the number of tasks or trajectories grows large.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)

Add feedback

Wasserstein Gradient Boosting: A Framework for Distribution-Valued Supervised Learning

Neural Information Processing SystemsOct-10-2025, 18:14:21 GMT

It solves distribution-valued supervised learning, where the output values of the training dataset are probability distributions.

algorithm, wasserstein gradient, wevidential, (12 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Basel-City > Basel (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Energy (0.68)
Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.93)
(3 more...)

Add feedback

RSN: Randomized Subspace Newton

Neural Information Processing SystemsAug-20-2025, 00:39:47 GMT

We develop a randomized Newton method capable of solving learning problems with huge dimensional feature spaces, which is a common setting in applications such as medical imaging, genomics and seismology. Our method leverages randomized sketching in a new way, by finding the Newton direction constrained to the space spanned by a random sketch. We develop a simple global linear convergence theory that holds for practically all sketching techniques, which gives the practitioners the freedom to design custom sketching approaches suitable for particular applications. We perform numerical experiments which demonstrate the efficiency of our method as compared to accelerated gradient descent and the full Newton method. Our method can be seen as a refinement and randomized extension of the results of Karimireddy, Stich, and Jaggi [18].

algorithm 1, matrix, newton method, (14 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Saudi Arabia (0.04)
North America > United States > Connecticut > New Haven County > New Haven (0.04)
North America > Canada (0.04)
(5 more...)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)

Add feedback

Reviews: Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models

Neural Information Processing SystemsFeb-5-2025, 22:54:10 GMT

After rebuttal: I have carefully read the authors' response. Unfortunately, I do not think my concerns are well addressed. See Table 2 in "Regularizing and Optimizing LSTM Language Models" for comparison; (4) the performance of SGD on a single GTX1080 GPU does not tell how it performs with multiple workers (larger mini-batch size); (5) selecting learning rate based on the test error is not a good practice. For machine learning, we should select the hyper-parameters according to the accuracy on a hold-out validation set. Considering the above five points, I decide to keep my score unchanged.

deep learning model, experiment, leader stochastic gradient descent, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GP-FL: Model-Based Hessian Estimation for Second-Order Over-the-Air Federated Learning

Hamidi, Shayan Mohajer, Bereyhi, Ali, Asaad, Saba, Poor, H. Vincent

arXiv.org Artificial IntelligenceDec-4-2024

Second-order methods are widely adopted to improve the convergence rate of learning algorithms. In federated learning (FL), these methods require the clients to share their local Hessian matrices with the parameter server (PS), which comes at a prohibitive communication cost. A classical solution to this issue is to approximate the global Hessian matrix from the first-order information. Unlike in idealized networks, this solution does not perform effectively in over-the-air FL settings, where the PS receives noisy versions of the local gradients. This paper introduces a novel second-order FL framework tailored for wireless channels. The pivotal innovation lies in the PS's capability to directly estimate the global Hessian matrix from the received noisy local gradients via a non-parametric method: the PS models the unknown Hessian matrix as a Gaussian process, and then uses the temporal relation between the gradients and Hessian along with the channel model to find a stochastic estimator for the global Hessian matrix. We refer to this method as Gaussian process-based Hessian modeling for wireless FL (GP-FL) and show that it exhibits a linear-quadratic convergence rate. Numerical experiments on various datasets demonstrate that GP-FL outperforms all classical baseline first and second order FL approaches.

artificial intelligence, gradient, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2412.03867

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > United States > District of Columbia > Washington (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Wasserstein Gradient Boosting: A General Framework with Applications to Posterior Regression

Matsubara, Takuo

arXiv.org Machine LearningMay-15-2024

Gradient boosting is a sequential ensemble method that fits a new base learner to the gradient of the remaining loss at each step. We propose a novel family of gradient boosting, Wasserstein gradient boosting, which fits a new base learner to an exactly or approximately available Wasserstein gradient of a loss functional on the space of probability distributions. Wasserstein gradient boosting returns a set of particles that approximates a target probability distribution assigned at each input. In probabilistic prediction, a parametric probability distribution is often specified on the space of output variables, and a point estimate of the output-distribution parameter is produced for each input by a model. Our main application of Wasserstein gradient boosting is a novel distributional estimate of the output-distribution parameter, which approximates the posterior distribution over the output-distribution parameter determined pointwise at each data point. We empirically demonstrate the superior performance of the probabilistic prediction by Wasserstein gradient boosting in comparison with various existing methods.

algorithm, wasserstein gradient, wgboost algorithm, (10 more...)

arXiv.org Machine Learning

2405.09536

Country: