Bad Global Minima Exist and SGD Can Reach Them
–Neural Information Processing Systems
SGD until 100% accuracy is achieved, in four different settings: 1. Random initialization + Training with true labels.
Neural Information Processing Systems
Oct-3-2025, 01:32:45 GMT