Pipe-SGD: A Decentralized Pipelined SGD Framework for Distributed Deep Net Training
Youjie Li, Mingchao Yu, Songze Li, Salman Avestimehr, Nam Sung Kim, Alexander Schwing
–Neural Information Processing Systems
In this paper, we carefully analyze the AllReduce based setup, propose timing models which include network latency, bandwidth, cluster size and compute time, and demonstrate that a pipelined training with a width oftwocombines thebest ofboth synchronous and asynchronous training.
Neural Information Processing Systems
Feb-12-2026, 12:38:04 GMT
- Country:
- North America
- United States > Illinois (0.04)
- Canada > Quebec
- Montreal (0.04)
- Asia > Middle East
- Jordan (0.04)
- North America
- Technology: