AITopics | Gradient Descent

Collaborating Authors

Gradient Descent

News Overviews Instructional Materials AI-Alerts Classics

News Overviews Instructional Materials AI-Alerts Classics

ef9280fbc5317f17d480e4d4f61b3751-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 05:16:59 GMT

artificial intelligence, arxiv preprint arxiv, machine learning, (12 more...)

Neural Information Processing Systems

Country:

Europe > Russia (0.14)
Asia > Russia (0.14)
Asia > Middle East > Saudi Arabia (0.04)
(2 more...)

Genre: Research Report (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.31)

ef48e3ef07e359006f7869b04fa07f5e-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 05:08:05 GMT

artificial intelligence, bayesian inference, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.46)

Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Neural Information Processing SystemsAug-17-2025, 02:22:51 GMT

We study first-order optimization algorithms for computing the barycenter of Gaussian distributions with respect to the optimal transport metric.

artificial intelligence, machine learning, optimization problem, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(7 more...)

Industry: Government > Regional Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.50)

939bb847ebfd14c6e4d3b5705e562054-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 01:30:50 GMT

artificial intelligence, machine learning, neural network, (14 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Switzerland > Vaud > Lausanne (0.05)
Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.30)

Estimating Training Data Influence by Tracing Gradient Descent Garima

Neural Information Processing SystemsAug-17-2025, 01:28:23 GMT

The method is general .

checkpoint, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.46)
North America > Canada (0.28)

Industry: Banking & Finance (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.51)

Globally Convergent Policy Search for Output Estimation

Neural Information Processing SystemsAug-17-2025

We introduce the first direct policy search algorithm which provably converges to the globally optimal dynamic filter for the classical problem of predicting the outputs of a linear dynamical system, given noisy, partial observations. Despite the ubiquity of partial observability in practice, theoretical guarantees for direct policy search algorithms, one of the backbones of modern reinforcement learning, have proven difficult to achieve. This is primarily due to the degeneracies which arise when optimizing over filters that maintain an internal state. In this paper, we provide a new perspective on this challenging problem based on the notion of informativity, which intuitively requires that all components of a filter's internal state are representative of the true state of the underlying dynamical system. We show that informativity overcomes the aforementioned degeneracy. Specifically, we propose a regularizer which explicitly enforces informativity, and establish that gradient descent on this regularized objective - combined with a "reconditioning step" - converges to the globally optimal cost at a O (1 /T) rate.

artificial intelligence, gradient descent, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > New Jersey (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.50)

Supplementary Material For Stochastic Multiple Target Sampling Gradient Descent

Neural Information Processing SystemsAug-16-2025, 23:00:18 GMT

This consists of the following sections: Appendix 1 contains the proofs and derivations of our theory development. As a consequence, we obtain the conclusion of Equation (1). By choosing u to be a one hot vector at i, we obtain the conclusion of Lemma 1. 1.3 Derivations for the matrix U's formulation in Equation (3) We have ϕ As a consequence, we obtain the conclusion of Equation (3). 3 1.4 Proof of Theorem 2 Before proving this theorem, let us re-state it: We have for all i = 1,...,K that D In this experiment, the three target distributions are created as presented in the main paper. Results are averaged over 5 runs. We take the best checkpoint in each approach based on the validation score.

artificial intelligence, machine learning, particle, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.40)

Stochastic Multiple Target Sampling Gradient Descent

Neural Information Processing SystemsAug-16-2025, 23:00:15 GMT

A natural question then arises: " Can we derive a probabilistic

artificial intelligence, machine learning, target distribution, (16 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
Asia > Vietnam (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.52)

Zeroth-Order Hard-Thresholding: Gradient Error vs. Expansivity William de V azelhes

Neural Information Processing SystemsAug-16-2025, 22:39:47 GMT

Hard-thresholding gradient descent is a dominant technique to solve this problem.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)
(2 more...)

Genre: Research Report (0.46)

Industry: Information Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.48)