Distributed Low-Communication Training with Decoupled Momentum Optimization

Open in new window