Feature learning is decoupled from generalization in high capacity neural networks

Open in new window