AITopics | ef21

Collaborating Authors

ef21

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

6fb9ea5197c0b8ece8a64220fb82cdfe-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 17:15:09 GMT

algorithm, compressor, ef-bv, (15 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Saudi Arabia (0.04)

Genre: Research Report (0.46)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Data Science (0.93)

Add feedback

Appendix

Neural Information Processing SystemsFeb-7-2026, 20:54:42 GMT

I.1 From unbiased to biased compressors. . . . . . . . . . . . . . . . . . . . . . . .

artificial intelligence, compressor, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

EF21: ANew, Simpler, TheoreticallyBetter, andPracticallyFasterErrorFeedback

Neural Information Processing SystemsFeb-7-2026, 20:54:38 GMT

Moreover,ourtheoretical analysis reliesonstandard assumptions only,works inthedistributed heterogeneous data setting, and leads tobetter and more meaningful rates.

artificial intelligence, arxivpreprintarxiv, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback

Neural Information Processing SystemsDec-23-2025, 21:18:50 GMT

Error feedback (EF), also known as error compensation, is an immensely popular convergence stabilization mechanism in the context of distributed training of supervised machine learning models enhanced by the use of contractive communication compression mechanisms, such as Top-$k$. First proposed by Seide et al [2014] as a heuristic, EF resisted any theoretical understanding until recently [Stich et al., 2018, Alistarh et al., 2018]. While these early breakthroughs were followed by a steady stream of works offering various improvements and generalizations, the current theoretical understanding of EF is still very limited. Indeed, to the best of our knowledge, all existing analyses either i) apply to the single node setting only, ii) rely on very strong and often unreasonable assumptions, such as global boundedness of the gradients, or iterate-dependent assumptions that cannot be checked a-priori and may not hold in practice, or iii) circumvent these issues via the introduction of additional unbiased compressors, which increase the communication cost. In this work we fix all these deficiencies by proposing and analyzing a new EF mechanism, which we call EF21, which consistently and substantially outperforms EF in practice. Moreover, our theoretical analysis relies on standard assumptions only, works in the distributed heterogeneous data setting, and leads to better and more meaningful rates. In particular, we prove that EF21 enjoys a fast $\mathcal{O}(1/T)$ convergence rate for smooth nonconvex problems, beating the previous bound of $\mathcal{O}(1/T^{2/3})$, which was shown under a strong bounded gradients assumption. We further improve this to a fast linear rate for Polyak-Lojasiewicz functions, which is the first linear convergence result for an error feedback method not relying on unbiased compressors. Since EF has a large number of applications where it reigns supreme, we believe that our 2021 variant, EF21, will have a large impact on the practice of communication efficient distributed learning.

ef21, faster error feedback, name change, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.74)

Add feedback

231141b34c82aa95e48810a9d1b33a79-Supplemental.pdf

Neural Information Processing SystemsOct-2-2025, 21:37:45 GMT

First, in Section A.1 we comment on experiments

artificial intelligence, machine learning, stepsize, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback

Neural Information Processing SystemsOct-2-2025, 21:37:42 GMT

This paper was written while Ilyas Fatkhullin was an intern at KAUST.

artificial intelligence, compressor, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > Middle East > Saudi Arabia > Mecca Province > Thuwal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

6fb9ea5197c0b8ece8a64220fb82cdfe-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-15-2025, 17:39:46 GMT

artificial intelligence, compressor, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Saudi Arabia (0.04)

Genre: Research Report (0.46)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Data Science (0.93)

Add feedback

EF-BV: A Unified Theory of Error Feedback and Variance Reduction Mechanisms for Biased and Unbiased Compression in Distributed Optimization

Neural Information Processing SystemsAug-15-2025, 17:39:42 GMT

In the case of biased and contractive compressors (e.g., top-k), the

artificial intelligence, compressor, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Saudi Arabia (0.04)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

EF21 with Bells & Whistles: Six Algorithmic Extensions of Modern Error Feedback

Fatkhullin, Ilyas, Sokolov, Igor, Gorbunov, Eduard, Li, Zhize, Richtárik, Peter

arXiv.org Artificial IntelligenceJun-23-2025

First proposed by Seide (2014) as a heuristic, error feedback (EF) is a very popular mechanism for enforcing convergence of distributed gradient-based optimization methods enhanced with communication compression strategies based on the application of contractive compression operators. However, existing theory of EF relies on very strong assumptions (e.g., bounded gradients), and provides pessimistic convergence rates (e.g., while the best known rate for EF in the smooth nonconvex regime, and when full gradients are compressed, is $O(1/T^{2/3})$, the rate of gradient descent in the same regime is $O(1/T)$). Recently, Richtárik et al. (2021) proposed a new error feedback mechanism, EF21, based on the construction of a Markov compressor induced by a contractive compressor. EF21 removes the aforementioned theoretical deficiencies of EF and at the same time works better in practice. In this work we propose six practical extensions of EF21, all supported by strong convergence theory: partial participation, stochastic approximation, variance reduction, proximal setting, momentum, and bidirectional compression. To the best of our knowledge, several of these techniques have not been previously analyzed in combination with EF, and in cases where prior analysis exists -- such as for bidirectional compression -- our theoretical convergence guarantees significantly improve upon existing results.

artificial intelligence, deep learning, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2110.03294

Country:

Asia > Russia (0.28)
North America > Canada > Ontario > Toronto (0.14)
Asia > Middle East > Saudi Arabia (0.04)
(6 more...)

Genre: Research Report > New Finding (0.92)

Industry: Information Technology (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)

Add feedback

Error Feedback under $(L_0,L_1)$-Smoothness: Normalization and Momentum

Khirirat, Sarit, Sadiev, Abdurakhmon, Riabinin, Artem, Gorbunov, Eduard, Richtárik, Peter

arXiv.org Artificial IntelligenceOct-22-2024

We provide the first proof of convergence for normalized error feedback algorithms across a wide range of machine learning problems. Despite their popularity and efficiency in training deep neural networks, traditional analyses of error feedback algorithms rely on the smoothness assumption that does not capture the properties of objective functions in these problems. Rather, these problems have recently been shown to satisfy generalized smoothness assumptions, and the theoretical understanding of error feedback algorithms under these assumptions remains largely unexplored. Moreover, to the best of our knowledge, all existing analyses under generalized smoothness either i) focus on single-node settings or ii) make unrealistically strong assumptions for distributed settings, such as requiring data heterogeneity, and almost surely bounded stochastic gradient noise variance. In this paper, we propose distributed error feedback algorithms that utilize normalization to achieve the $O(1/\sqrt{K})$ convergence rate for nonconvex problems under generalized smoothness. Our analyses apply for distributed settings without data heterogeneity conditions, and enable stepsize tuning that is independent of problem parameters. Additionally, we provide strong convergence guarantees of normalized error feedback algorithms for stochastic settings. Finally, we show that due to their larger allowable stepsizes, our new normalized error feedback algorithms outperform their non-normalized counterparts on various tasks, including the minimization of polynomial functions, logistic regression, and ResNet-20 training.

artificial intelligence, exp, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2410.16871

Country: