Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization