ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training