On the Quality of the Initial Basin in Overspecified Neural Networks