Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks

Open in new window