Variational Stochastic Gradient Descent for Deep Neural Networks