GradiVeQ: Vector Quantization for Bandwidth-Efficient Gradient Aggregation in Distributed CNN Training
Mingchao Yu, Zhifeng Lin, Krishna Narra, Songze Li, Youjie Li, Nam Sung Kim, Alexander Schwing, Murali Annavaram, Salman Avestimehr
–Neural Information Processing Systems
Data parallelism can boost the training speed of convolutional neural networks (CNN), but could suffer from significant communication costs caused by gradient aggregation.
Neural Information Processing Systems
Nov-20-2025, 20:11:32 GMT
- Country:
- North America
- Canada
- United States
- California (0.14)
- Illinois (0.04)
- North America
- Industry:
- Government > Regional Government (0.46)
- Technology: