Emergence and scaling laws in SGD learning of shallow neural networks

Open in new window