Deep Networks Provably Classify Data on Curves

Neural Information Processing Systems 

To our knowledge, this is the first generalization guarantee for deep networks with nonlinear data that depends only on intrinsic data properties.