Information-Theoretic Generalization Bounds for Stochastic Gradient Descent

Open in new window