61162d94822d468ee6e92803340f2040-Supplemental-Conference.pdf
–Neural Information Processing Systems
In data-parallel optimization of machine learning models, workers collaborate to improve their estimates of the model: more accurate gradients allow them to use larger learning rates and optimize faster. We consider the setting in which allworkerssample fromthesamedataset, andcommunicate overasparsegraph (decentralized).
Neural Information Processing Systems
Feb-9-2026, 09:06:57 GMT
- Technology: