Neural nets - learning with total gradient rather than stochastic gradients? • /r/MachineLearning

Sep-12-2016, 20:15:51 GMT–#artificialintelligence

The estimate of the gradient from just a mini-batch is usually good enough to point you in the right descent direction. It doesn't make sense to do the extra computation for a marginally better estimate. Plus, the inaccuracy or noise introduced by the mini-batch approximation can act as a regularizer. Here is an interesting paper that performs statistical tests during optimization: if the gradient is not statistically significant, more samples are added to the mini-batch.

artificial intelligence, gradient, neural network, (3 more...)

#artificialintelligence

Sep-12-2016, 20:15:51 GMT

News Web Page

Add feedback

Industry:
- Energy > Oil & Gas (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks (0.40)
    - Statistical Learning > Gradient Descent (0.40)
  - Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found