Stochastic Gradient Descent and Anomaly of Variance-flatness Relation in Artificial Neural Networks

Open in new window