On the Convergence of Stochastic Gradient Descent in Low-precision Number Formats