On Communication Cost of Distributed Statistical Estimation and Dimensionality
Ankit Garg, Tengyu Ma, Huy Nguyen
–Neural Information Processing Systems
We explore the connection between dimensionality and communication cost in distributed learning problems. Specifically we study the problem of estimating the mean ~ of an unknown d dimensional gaussian distribution in the distributed setting. In this problem, the samples from the unknown distribution are distributed among m different machines. The goal is to estimate the mean ~ at the optimal minimax rate while communicating as few bits as possible. We show that in this setting, the communication cost scales linearly in the number of dimensions i.e. one needs to deal with different dimensions individually.
Neural Information Processing Systems
Feb-9-2025, 02:10:28 GMT