CSER: Communication-efficient SGD with Error Reset

Dec-24-2025, 07:33:41 GMT–Neural Information Processing Systems

The scalability of Distributed Stochastic Gradient Descent (SGD) is today limited by communication bottlenecks.

communication-efficient sgd, cser, underline, (5 more...)

Neural Information Processing Systems

Dec-24-2025, 07:33:41 GMT

Conferences Web Page

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.62)