S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning