Active Bias: Training More Accurate Neural Networks by Emphasizing High Variance Samples

Chang, Haw-Shiuan, Learned-Miller, Erik, McCallum, Andrew

Jan-6-2018–arXiv.org Machine Learning

Self-paced learning and hard example mining re-weight training instances to improve learning accuracy. This paper presents two improved alternatives based on lightweight estimates of sample uncertainty in stochastic gradient descent (SGD): the variance in predicted probability of the correct class across iterations of mini-batch SGD, and the proximity of the correct class probability to the decision threshold. Extensive experimental results on six datasets show that our methods reliably improve accuracy in various network architectures, including additional gains on top of other popular training techniques, such as residual learning, momentum, ADAM, batch normalization, dropout, and distillation.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Machine Learning

Jan-6-2018

arXiv.org PDF

Add feedback

Country:
- North America > United States > Massachusetts (0.46)

Genre:
- Research Report (1.00)

Industry:
- Education (0.67)
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (0.68)
  - Statistical Learning > Gradient Descent (0.55)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found