Considerminimizinganempiricalloss min

Feb-9-2026, 17:16:17 GMT–Neural Information Processing Systems

Many learning tasks, such as regression and classification, are usually framed that way [1]. When N 1, computing the gradient of the objective in(1) becomes a bottleneck, even if individual gradients θL(zi,θ) are cheap to evaluate. For a fixed computational budget, itisthustempting toreplace vanilla gradient descent bymore iterations but using anapproximate gradient, obtained using only afewdata points. Stochastic gradient descent (SGD; [2]) follows this template.

artificial intelligence, dpp, machine learning, (16 more...)

Neural Information Processing Systems

Feb-9-2026, 17:16:17 GMT

Conferences PDF

Add feedback

Country:
- Asia > Singapore (0.04)
- Europe > France (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.74)

Duplicate Docs Excel Report

Title
Determinantal point processes based on orthogonal polynomials for sampling minibatches in SGD Rémi Bardenet

Similar Docs Excel Report more

Title	Similarity	Source
None found