Feature Learning in L2-regularized DNNs: Attraction/Repulsion and Sparsity

Neural Information Processing Systems 

NTK regime) without feature learning and an active regime where features are learned.