How Feature Learning Can Improve Neural Scaling Laws