[1609.04836v1] On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima • /r/MachineLearning

@machinelearnbot 

Have you seen the work of Friedlander and Schmidt, and my follow up paper (shameless plug, toot toot)? Though our analysis is restricted to convex functions, there is also a notion of "sharpness" of minima which is appears as the condition number of the problem.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found