Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning

May-21-2025, 23:18:18 GMT–Neural Information Processing Systems

We study the asynchronous stochastic gradient descent algorithm for distributed training over n workers which have varying computation and communication frequency over time. In this algorithm, workers compute stochastic gradients in parallel at their own pace and return those to the server without any synchronization.

artificial intelligence, gradient, machine learning, (16 more...)

Neural Information Processing Systems

May-21-2025, 23:18:18 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Statistical Learning
    - Gradient Descent (0.76)
  - Representation & Reasoning (1.00)