BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training

Songtao Wang, Dan Li, Yang Cheng, Jinkun Geng, Yanshu Wang, Shuai Wang, Shu-Tao Xia, Jianping Wu

Feb-15-2026, 05:56:40 GMT–Neural Information Processing Systems

In distributed machine learning (DML), the network performance between machines significantly impacts the speed of iterative training. In this paper we propose BML, a new gradient synchronization algorithm with higher network performance and lower network cost than the current practice. BML runs on BCube network, instead of using the traditional Fat-Tree topology.

artificial intelligence, machine learning, server, (16 more...)

Neural Information Processing Systems

Feb-15-2026, 05:56:40 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada (0.04)
- Asia > China
  - Guangdong Province > Shenzhen (0.04)

Genre:
- Research Report (0.46)

Technology:
- Information Technology
  - Communications > Networks (1.00)
  - Artificial Intelligence > Machine Learning
    - Neural Networks (0.47)

Duplicate Docs Excel Report

Title
BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training
BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training

Similar Docs Excel Report more

Title	Similarity	Source
None found