Figure 1 . 11 easy . Figure 2 Empirical sync . time

Neural Information Processing Systems 

General response: We thank all reviewers for their comments. RL achieve a 3.55 0.3 and 1. 50 0. 7 score difference, respectively. The training curve is shown in Figure 1. In Claim 1, we assume the step times follow an exponential distribution. Hence the sum of step times (synchronization time) follows a Gamma distribution.