Reviews: A Linear Speedup Analysis of Distributed Deep Learning with Sparse and Quantized Communication

Oct-7-2024, 05:47:38 GMT–Neural Information Processing Systems

Especially the ones related to experiments, and parameter tuning. Since the authors appropriately responded to some of my key criticisms I will upgrade my score from 5 to 6. original review This paper analyzes the convergence rate of distributed mini-batch SGD using sparse and quantized communication with application to deep learning. Based on the analysis, it proposes combining sparse and quantized communication to further reduce the communication cost that burdens the wall clock runtime in distributed setups. The paper is generally well written. The convergence analysis does appear to make sense, and the proposed combination of sparsification and quantization seems to save runtime a little bit with proper parameter tuning.

communication, linear speedup analysis, sparse and quantized communication, (7 more...)

Neural Information Processing Systems

Oct-7-2024, 05:47:38 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)