Implicit Bias in Deep Linear Classification: Initialization Scale vs Training Accuracy

Open in new window