AITopics | Gradient Descent

Collaborating Authors

Gradient Descent

News Overviews Instructional Materials AI-Alerts Classics

Appendix to: Training Uncertainty-Aware Classifiers with Conformalized Deep Learning Bat-Sheva Einbinder Y aniv Romano Matteo Sesia Y anfei Zhou A1 Additional methodological details

Neural Information Processing SystemsAug-16-2025, 21:19:10 GMT

Authors listed in alphabetical order. Figure A1: Schematic of the proposed uncertainty-aware deep classification learning algorithm. This procedure is summarized in Algorithm A1, which is a more technical version of Algorithm 1. (t 1) (t 1) This section explains the implementation of the hybrid benchmark method applied in Section 4. This This benchmark is based on a loss function designed to incentivize the trained model to produce the smallest possible conformal prediction sets with the desired coverage (e.g., 90% if (t 1) (t 1) To facilitate the exposition of our analysis, we begin by introducing some helpful notations. The first part of the proof is standard and proceeds as follows. A3.1 Details about experiments with synthetic data The conditional data-generating distribution of Y given X is given by: P[Y | X ] = null Our method (resp., the hybrid method) is applied using The hybrid loss model is trained via stochastic gradient descent for 4000 epochs with learning rate 0.01 decreased by a factor 10 halfway through training.

artificial intelligence, conformal prediction, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Asia > Middle East > Israel (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.69)

Add feedback

Non-convex online learning via algorithmic equivalence

Neural Information Processing SystemsAug-16-2025, 19:41:32 GMT

We study an algorithmic equivalence technique between non-convex gradient descent and convex mirror descent.

artificial intelligence, descent, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Industry: Education > Educational Setting > Online (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.53)

Add feedback

8b40b4984e6c09ee49333ddd2dc719d4-Paper-Conference.pdf

Neural Information Processing SystemsAug-16-2025, 19:41:29 GMT

artificial intelligence, descent, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.33)

Add feedback

d714d2c5a796d5814c565d78dd16188d-Paper.pdf

Neural Information Processing SystemsAug-16-2025, 16:15:33 GMT

Sampling-based methods promise scalability improvements when paired with stochastic gradient descent in training Graph Convolutional Networks (GCNs).

azy gcn, gcn, node, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.05)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning Zhenhuan Yang 1 Y unwen Lei 2 Puyu Wang 3 Tianbao Yang

Neural Information Processing SystemsAug-16-2025, 15:15:19 GMT

Pairwise learning refers to learning tasks where the loss function depends on a pair of instances.

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
North America > United States > New York > Albany County > Albany (0.04)
North America > United States > Iowa > Johnson County > Iowa City (0.04)
(2 more...)

Industry: Education > Educational Setting > Online (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.52)

Add feedback

Decentralized Accelerated Proximal Gradient Descent Haishan Y e 1 Ziang Zhou 1 Luo

Neural Information Processing SystemsAug-16-2025, 15:12:56 GMT

In this paper, we study the decentralized composite optimization problem with a non-smooth regularization term.

algorithm, complexity, inequality, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Hong Kong (0.04)
North America > Canada > British Columbia > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.51)

Add feedback

Decentralized Accelerated Proximal Gradient Descent Haishan Y e 1 Ziang Zhou 1 Luo

Neural Information Processing SystemsAug-16-2025, 15:12:49 GMT

In this paper, we study the decentralized composite optimization problem with a non-smooth regularization term.

algorithm, complexity, convergence rate, (13 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.05)
Asia > China > Hong Kong (0.04)
North America > Canada > British Columbia > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.52)

Add feedback

Convergence and Alignment of Gradient Descent with Random Backpropagation Weights Ganlin Song Ruitu Xu John Lafferty Department of Statistics and Data Science

Neural Information Processing SystemsAug-16-2025, 14:27:01 GMT

Stochastic gradient descent with backpropagation is the workhorse of artificial neural networks. It has long been recognized that backpropagation fails to be a biologically plausible algorithm. Fundamentally, it is a non-local procedure-- updating one neuron's synaptic weights requires knowledge of synaptic weights or receptive fields of downstream neurons. This limits the use of artificial neural networks as a tool for understanding the biological principles of information processing in the brain. Lillicrap et al. (2016) propose a more biologically plausible "feedback alignment" algorithm that uses random and fixed backpropagation weights, and show promising simulations. In this paper we study the mathematical properties of the feedback alignment procedure by analyzing convergence and alignment for two-layer networks under squared error loss. In the overparameter-ized setting, we prove that the error converges to zero exponentially fast, and also that regularization is necessary in order for the parameters to become aligned with the random backpropagation weights. Simulations are given that are consistent with this analysis and suggest further generalizations. These results contribute to our understanding of how biologically plausible algorithms might carry out weight learning in a manner different from Hebbian learning, with performance that is comparable with the full non-local backpropagation algorithm.

alignment, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country: