Robust Learning of Discrete Distributions from Batches
Let $d$ be the lowest $L_1$ distance to which a $k$-symbol distribution $p$ can be estimated from $m$ batches of $n$ samples each, when up to $\beta m$ batches may be adversarial. For $\beta<1/2$, Qiao and Valiant (2017) showed that $d=\Omega(\beta/\sqrt{n})$ and requires $m=\Omega(k/\beta^2)$ batches. For $\beta<1/900$, they provided a $d$ and $m$ order-optimal algorithm that runs in time exponential in $k$. For $\beta<0.5$, we propose an algorithm with comparably optimal $d$ and $m$, but run-time polynomial in $k$ and all other parameters.
Nov-19-2019
- Country:
- North America > United States
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California > San Diego County
- San Diego (0.04)
- Massachusetts > Middlesex County
- Asia > Afghanistan
- Parwan Province > Charikar (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Technology: