Towards a mathematical understanding of learning from few examples with nonlinear feature maps