Bad Global Minima Exist and SGD Can Reach Them

Neural Information Processing Systems 

SGD until 100% accuracy is achieved, in four different settings: 1. Random initialization + Training with true labels.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found