Stochastic Quasi-Newton Optimization in Large Dimensions Including Deep Network Training