Unveiling the Power of Multiple Gossip Steps: AStability-Based Generalization Analysis in Decentralized Training

Open in new window