AITopics | newton

Collaborating Authors

newton

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

One-step differentiation of iterative algorithms

Neural Information Processing SystemsFeb-17-2026, 23:00:39 GMT

For iterative algorithms, implicit differentiation alleviates this issue but requires custom implementation of Jacobian evaluation. In this paper, we study one-step differentiation, also known as Jacobian-free backpropagation, a method as easy as automatic differentiation and as efficient as implicit differentiation for fast algorithms (e.g., superlinear

artificial intelligence, differentiation, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > France > Occitanie > Haute-Garonne > Toulouse (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Provence-Alpes-Côte d'Azur > Alpes-Maritimes > Nice (0.04)
Europe > Finland > Uusimaa > Helsinki (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

b1b20d09041289e6c3fbb81850c5da54-Paper.pdf

Neural Information Processing SystemsFeb-13-2026, 16:15:59 GMT

AIDE: Fastand Communication Efficient Distributed Optimization.arXive-prints,

artificial intelligence, machine learning, newton, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > United States > California > Alameda County > Berkeley (0.05)
Oceania > Australia > New South Wales > Sydney (0.04)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Almost Surely Stable Deep Dynamics

Neural Information Processing SystemsFeb-10-2026, 17:17:00 GMT

However, wecanmake(9)moretractable (bothforpredictionandtraining) byreducingittoa1-dimensional root-findingproblem: Find ?

artificial intelligence, lyapunovfunction, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Symmetry Teleportationfor Accelerated Optimization

Neural Information Processing SystemsFeb-9-2026, 14:26:43 GMT

Consider GofthelossL(w), meaning g 2G, L(w)= L(g w).

artificial intelligence, conferenceon machine learning, ininternational conferenceon machine learning, (9 more...)

Neural Information Processing Systems

Country: North America > United States > California > San Diego County > San Diego (0.05)

Industry: Government > Regional Government (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

52aaa62e71f829d41d74892a18a11d59-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 16:46:25 GMT

algorithm, projection, projection problem, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada (0.04)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.46)

Add feedback

0b2b199fdd52089b31d3a0120e400b2a-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 17:06:25 GMT

They derived a parallel form of Newton's method to solve the fixed-point problem and achieved significant speedups over sequential evaluation.

artificial intelligence, convergence, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada > Ontario > Toronto (0.04)
(2 more...)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

How to Use Physics to Escape an Ice Bowl

WIREDFeb-1-2026, 12:00:00 GMT

Here are three smart tricks, based on an understanding of frictional forces, to beat a slippery slope. I don't know who invented this crazy challenge, but the idea is to put someone in a carved-out ice bowl and see if they can get out. The bowl is shaped like the inside of a sphere, so the higher up the sides you go, the steeper it gets. If you think an icy sidewalk is slippery, try going uphill on an icy sidewalk. What do you do when faced with a problem like this?

artificial intelligence, frictional force, normal force, (16 more...)

WIRED

Country:

North America > United States (0.29)
Europe (0.29)

Industry: Leisure & Entertainment > Sports > Football (0.62)

Technology: Information Technology > Artificial Intelligence (0.48)

Add feedback

Distributed Newton Can Communicate Less and Resist Byzantine Workers

Neural Information Processing SystemsDec-24-2025, 16:20:49 GMT

We develop a distributed second order optimization algorithm that is communication-efficient as well as robust against Byzantine failures of the worker machines. We propose an iterative approximate Newton-type algorithm, where the worker machines communicate \emph{only once} per iteration with the central machine. This is in sharp contrast with the state-of-the-art distributed second order algorithms like GIANT \cite{giant}, DINGO\cite{dingo}, where the worker machines send (functions of) local gradient and Hessian sequentially; thus ending up communicating twice with the central machine per iteration. Furthermore, we employ a simple norm based thresholding rule to filter-out the Byzantine worker machines. We establish the linear-quadratic rate of convergence of our proposed algorithm and establish that the communication savings and Byzantine resilience attributes only correspond to a small statistical error rate for arbitrary convex loss functions. To the best of our knowledge, this is the first work that addresses the issue of Byzantine resilience in second order distributed optimization. Furthermore, we validate our theoretical results with extensive experiments on synthetically generated and benchmark LIBSVM \cite{libsvm} data-set and demonstrate convergence guarantees.

name change, proceedings, resist byzantine worker, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression

Neural Information Processing SystemsMay-27-2025, 13:17:33 GMT

Transformers excel at *in-context learning* (ICL)---learning from demonstrations without parameter updates---but how they do so remains a mystery. Recent work suggests that Transformers may internally run Gradient Descent (GD), a first-order optimization method, to perform ICL. In this paper, we instead demonstrate that Transformers learn to approximate second-order optimization methods for ICL. For in-context linear regression, Transformers share a similar convergence rate as *Iterative Newton's Method*, both *exponentially* faster than GD. Empirically, predictions from successive Transformer layers closely match different iterations of Newton's Method linearly, with each middle layer roughly computing 3 iterations; thus, Transformers and Newton's method converge at roughly the same rate.

in-context linear regression, second-order convergence rate, transformer, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.65)

Add feedback

Filters

Collaborating Authors

newton

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

One-step differentiation of iterative algorithms

b1b20d09041289e6c3fbb81850c5da54-Paper.pdf

Almost Surely Stable Deep Dynamics

Symmetry Teleportationfor Accelerated Optimization

52aaa62e71f829d41d74892a18a11d59-Paper.pdf

0b2b199fdd52089b31d3a0120e400b2a-Paper-Conference.pdf

How to Use Physics to Escape an Ice Bowl

Distributed Newton Can Communicate Less and Resist Byzantine Workers

77d52754ff6b2de5a5d96ee921b6b3cd-Paper-Conference.pdf

Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression