Train faster, generalize better: Stability of stochastic gradient descent

Open in new window