On the Ineffectiveness of Variance Reduced Optimization for Deep Learning

Feb-12-2026, 19:32:42 GMT–Neural Information Processing Systems

SVR methods use control variates to reduce the variance of the traditional stochastic gradient descent (SGD) estimate f0i(w) of the full gradient f0(w). Control variates are a classical technique for reducing the variance of a stochastic quantity without introducing bias. Say we have some random variable X.

artificial intelligence, machine learning, variance reduction, (15 more...)

Neural Information Processing Systems

Feb-12-2026, 19:32:42 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (0.69)
  - Statistical Learning > Gradient Descent (0.55)

Duplicate Docs Excel Report

Title
On the Ineffectiveness of Variance Reduced Optimization for Deep Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found