Statistical Query Lower Bounds for List-Decodable Linear Regression
Diakonikolas, Ilias, Kane, Daniel M., Pensia, Ankit, Pittas, Thanasis, Stewart, Alistair
We study the problem of list-decodable linear regression, where an adversary can corrupt a majority of the examples. Specifically, we are given a set $T$ of labeled examples $(x, y) \in \mathbb{R}^d \times \mathbb{R}$ and a parameter $0< \alpha <1/2$ such that an $\alpha$-fraction of the points in $T$ are i.i.d. samples from a linear regression model with Gaussian covariates, and the remaining $(1-\alpha)$-fraction of the points are drawn from an arbitrary noise distribution. The goal is to output a small list of hypothesis vectors such that at least one of them is close to the target regression vector. Our main result is a Statistical Query (SQ) lower bound of $d^{\mathrm{poly}(1/\alpha)}$ for this problem. Our SQ lower bound qualitatively matches the performance of previously developed algorithms, providing evidence that current upper bounds for this task are nearly best possible.
Jun-17-2021
- Country:
- North America > United States
- Maryland > Baltimore (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- New York > New York County
- New York City (0.04)
- California
- San Diego County > San Diego (0.04)
- Monterey County > Pacific Grove (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia
- Middle East > Jordan (0.04)
- Afghanistan > Parwan Province
- Charikar (0.04)
- North America > United States
- Genre:
- Research Report (0.82)
- Technology: