AITopics | bidirectional compression

Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression Xinmeng Huang 1 Yiming Chen 2,3 Wotao Yin

Neural Information Processing SystemsFeb-9-2026, 23:13:49 GMT

Recent advances in distributed optimization and learning have shown that communication compression is one of the most effective means of reducing communication. While there have been many results for convergence rates with compressed communication, a lower bound is still missing. Analyses of algorithms with communication compression have identified two abstract properties that guarantee convergence: the unbiased property or the contrac-tive property.

artificial intelligence, compression, machine learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.49)

Add feedback

Preserved central model for faster bidirectional compression in distributed settings

Neural Information Processing SystemsDec-23-2025, 19:09:03 GMT

We develop a new approach to tackle communication constraints in a distributed learning problem with a central server. We propose and analyze a new algorithm that performs bidirectional compression and achieves the same convergence rate as algorithms using only uplink (from the local workers to the central server) compression. To obtain this improvement, we design MCM, an algorithm such that the downlink compression only impacts local models, while the global model is preserved. As a result, and contrary to previous works, the gradients on local servers are computed on perturbed models. Consequently, convergence proofs are more challenging and require a precise control of this perturbation. To ensure it, MCM additionally combines model compression with a memory mechanism. This analysis opens new doors, e.g.

bidirectional compression, faster bidirectional compression, preserved central model, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.79)

Add feedback

13d63838ef1fb6f34ca2dc6821c60e49-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 11:28:27 GMT

artificial intelligence, compression, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > France (0.04)
North America > United States > Virginia (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression Xinmeng Huang 1 Yiming Chen 2,3 Wotao Yin

Neural Information Processing SystemsAug-16-2025, 03:10:59 GMT

Recent advances in distributed optimization and learning have shown that communication compression is one of the most effective means of reducing communication. While there have been many results for convergence rates with compressed communication, a lower bound is still missing. Analyses of algorithms with communication compression have identified two abstract properties that guarantee convergence: the unbiased property or the contrac-tive property.

artificial intelligence, compression, machine learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.49)

Add feedback

Preserved central model for faster bidirectional compression in distributed settings

Neural Information Processing SystemsOct-9-2024, 14:02:36 GMT

We develop a new approach to tackle communication constraints in a distributed learning problem with a central server. We propose and analyze a new algorithm that performs bidirectional compression and achieves the same convergence rate as algorithms using only uplink (from the local workers to the central server) compression. To obtain this improvement, we design MCM, an algorithm such that the downlink compression only impacts local models, while the global model is preserved. As a result, and contrary to previous works, the gradients on local servers are computed on perturbed models. Consequently, convergence proofs are more challenging and require a precise control of this perturbation.

bidirectional compression, faster bidirectional compression, preserved central model, (2 more...)

Neural Information Processing Systems

Industry: Education > Focused Education > Special Education (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.88)

Add feedback

Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression

Huang, Xinmeng, Chen, Yiming, Yin, Wotao, Yuan, Kun

arXiv.org Artificial IntelligenceOct-11-2022

Recent advances in distributed optimization and learning have shown that communication compression is one of the most effective means of reducing communication. While there have been many results on convergence rates under communication compression, a theoretical lower bound is still missing. Analyses of algorithms with communication compression have attributed convergence to two abstract properties: the unbiased property or the contractive property. They can be applied with either unidirectional compression (only messages from workers to server are compressed) or bidirectional compression. In this paper, we consider distributed stochastic algorithms for minimizing smooth and non-convex objective functions under communication compression. We establish a convergence lower bound for algorithms whether using unbiased or contractive compressors in unidirection or bidirection. To close the gap between the lower bound and the existing upper bounds, we further propose an algorithm, NEOLITHIC, which almost reaches our lower bound (up to logarithm factors) under mild conditions. Our results also show that using contractive bidirectional compression can yield iterative methods that converge as fast as those using unbiased unidirectional compression. The experimental results validate our findings.

artificial intelligence, compression, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2206.03665

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Pennsylvania (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

Artemis: tight convergence guarantees for bidirectional compression in Federated Learning

Philippenko, Constantin, Dieuleveut, Aymeric

arXiv.org Machine LearningNov-5-2020

We introduce a new algorithm - Artemis - tackling the problem of learning in a distributed framework with communication constraints. Several workers (randomly sampled) perform the optimization process using a central server to aggregate their computation. To alleviate the communication cost, Artemis compresses the information sent in both directions (from the workers to the server and conversely) combined with a memory mechanism. It improves on existing quantized federated learning algorithms that only consider unidirectional compression (to the server), or use very strong assumptions on the compression operator, and often do not take into account devices partial participation. We provide fast rates of convergence (linear up to a threshold) under weak assumptions on the stochastic gradients (noise's variance bounded only at optimal point) in non-i.i.d. setting, highlight the impact of memory for unidirectional and bidirectional compression, analyze Polyak-Ruppert averaging. We use convergence in distribution to obtain a lower bound of the asymptotic variance that highlights practical limits of compression. And we provide experimental results to demonstrate the validity of our analysis.

artificial intelligence, compression, machine learning, (15 more...)

arXiv.org Machine Learning

2006.14591

Country:

Europe > France (0.04)
Europe > Middle East > Malta (0.04)
North America > United States > New York > New York County > New York City (0.04)
(9 more...)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)

Add feedback

Collaborating Authors

bidirectional compression

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression Xinmeng Huang 1 Yiming Chen 2,3 Wotao Yin

Preserved central model for faster bidirectional compression in distributed settings

13d63838ef1fb6f34ca2dc6821c60e49-Paper.pdf

Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression Xinmeng Huang 1 Yiming Chen 2,3 Wotao Yin

Preserved central model for faster bidirectional compression in distributed settings

Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression

Artemis: tight convergence guarantees for bidirectional compression in Federated Learning