OSP: Boosting Distributed Model Training with 2-stage Synchronization