The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure For Least Squares Rong Ge1, Sham M. Kakade

Neural Information Processing Systems 

In contrast, the behavior of SGD's final iterate has received much less

Similar Docs  Excel Report  more

TitleSimilaritySource
None found