Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey