Is normalization indispensable for training deep neural network?

Open in new window