AITopics | Gradient Descent

However, it remains unclear if we can further improve the convergence rate when the assumptions for the function in the population level also hold for each random realization almost surely (e.g., Lipschitzness of each realization of the stochastic gradient).

machine learning, natural language, optimization, (17 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Industry: Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.49)

Add feedback

Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates

Neural Information Processing SystemsFeb-16-2026, 10:16:52 GMT

In particular, we establish the surprising result that: F or any constant learning rate η > 0, the stochastic gradient bandit algorithm is guaranteed to converge to the globally optimal policy almost surely.

artificial intelligence, convergence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.85)

Add feedback

Behavior Alignment via Reward Function Optimization Dhawal Gupta University of Massachusetts Y ash Chandak

Neural Information Processing SystemsFeb-16-2026, 08:13:30 GMT

Designing reward functions for efficiently guiding reinforcement learning (RL) agents toward specific behaviors is a complex task.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts (0.40)
North America > Canada > Alberta (0.14)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

Stable Nonconvex-Nonconcave Training via Linear Interpolation

Neural Information Processing SystemsFeb-16-2026, 03:15:26 GMT

By replacing the inner optimizer in RAPP we rediscover the family of Lookahead algorithms for which we establish convergence in cohypomonotone problems even when the base optimizer is taken to be gradient descent ascent.

artificial intelligence, convergence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Asia > Middle East > Israel (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback

Online Performative Gradient Descent for Learning Nash Equilibria in Decision-Dependent Games Zihan Zhu Duke University Ethan X. Fang Duke University Zhuoran Yang Yale University

Neural Information Processing SystemsFeb-15-2026, 23:56:38 GMT

We focus on finding the Nash equilibrium of decision-dependent games in the bandit feedback setting.

artificial intelligence, decision-dependent game, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.84)

Add feedback

The Sample Complexity of Gradient Descent in Stochastic Convex Optimization Roi Livni School of Electrical Engineering Tel Aviv University rlivni@tauex.tau.ac.il

Neural Information Processing SystemsFeb-15-2026, 22:13:02 GMT

But, also, it has become focus of study because it is one of few theoretical settings that exhibit overparameterized learning .

artificial intelligence, machine learning, oracle, (16 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.40)

Genre: Research Report > Experimental Study (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.50)

Add feedback

Transformers learn to implement preconditioned gradient descent for in-context learning

Neural Information Processing SystemsFeb-15-2026, 20:28:17 GMT

Several recent works demonstrate that transformers can implement algorithms like gradient descent. By a careful construction of weights, these works show that multiple layers of transformers are expressive enough to simulate iterations of gradient descent.

artificial intelligence, machine learning, transformer, (19 more...)

Neural Information Processing Systems

Country: