SurprisingInstabilitiesinTrainingDeepNetworksand aTheoreticalAnalysis