Coordinating Distributed Example Orders for Provably Accelerated Training
–Neural Information Processing Systems
Whereas RR arbitrarily permutes training examples, GraB leverages stale gradients from prior epochs to order examples -- achieving a provably faster convergence rate than RR.
Neural Information Processing Systems
Feb-16-2026, 13:52:18 GMT
- Country:
- Europe > Belgium
- Brussels-Capital Region > Brussels (0.04)
- North America > United States (0.04)
- Europe > Belgium
- Genre:
- Research Report > New Finding (1.00)
- Technology: