Weight Initialization without Local Minima in Deep Nonlinear Neural Networks