Control Batch Size and Learning Rate to Generalize Well: Theoretical and Empirical Evidence

Fengxiang He, Tongliang Liu, Dacheng Tao

Neural Information Processing Systems 

LetS bethe independently {1,2,...,}, whereN is Thensimilar S is ˆgS( (t)) =r (t) ˆR( (t)) = 1 |S| X International [12] K. He, Confer [13] K. He, confer, 2016.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found