The Optimization Landscape of SGD Across the Feature Learning Strength

Open in new window