Coordinate Descent Converges Faster with the Gauss-Southwell Rule Than Random Selection