A Theory of Feature Learning