[1609.04836v1] On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima • /r/MachineLearning

Open in new window