A Linear Speedup Analysis of Distributed Deep Learning with Sparse and Quantized Communication