Emergence and scaling laws in SGDlearning of shallow neural networks

Open in new window