Importance Sampling for Stochastic Gradient Descent in Deep Neural Networks