Communication-efficient SGD: From Local SGD to One-Shot Averaging
–Neural Information Processing Systems
This method requires each worker to share their computed gradients with each other at every iteration. We will refer to this method as "synchronized parallel SGD."
Neural Information Processing Systems
Nov-15-2025, 16:59:06 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- Massachusetts > Suffolk County > Boston (0.04)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.94)
- Technology: