The Convergence of Sparsified Gradient Methods Dan Alistarh Cédric Renggli IST Austria