Note on Learning Rate Schedules for Stochastic Optimization

Dec-31-1991–Neural Information Processing Systems

We present and compare learning rate schedules for stochastic gradient descent, a general algorithm which includes LMS, online backpropagation and k-means clustering as special cases. We introduce "search-thenconverge" type schedules which outperform the classical constant and "running average" (1ft) schedules both in speed of convergence and quality of solution.

artificial intelligence, exemplar, machine learning, (14 more...)

Neural Information Processing Systems

Dec-31-1991

Conferences PDF

Add feedback