WNGrad: Learn the Learning Rate in Gradient Descent