Reviews: The Convergence Rate of Neural Networks for Learned Functions of Different Frequencies

Neural Information Processing Systems 

It finds that lower frequencies learn first, and finds that biases allow for learning of odd frequencies. The restriction to spherical data is limiting, but the analysis and conclusions (particularly the rates of convergence) are novel and interesting.