Efficient List-Decodable Regression using Batches

Das, Abhimanyu, Jain, Ayush, Kong, Weihao, Sen, Rajat

Nov-23-2022–arXiv.org Artificial Intelligence

We begin the study of list-decodable linear regression using batches. In this setting only an $\alpha \in (0,1]$ fraction of the batches are genuine. Each genuine batch contains $\ge n$ i.i.d. samples from a common unknown distribution and the remaining batches may contain arbitrary or even adversarial samples. We derive a polynomial time algorithm that for any $n\ge \tilde \Omega(1/\alpha)$ returns a list of size $\mathcal O(1/\alpha^2)$ such that one of the items in the list is close to the true regression parameter. The algorithm requires only $\tilde{\mathcal{O}}(d/\alpha^2)$ genuine batches and works under fairly general assumptions on the distribution. The results demonstrate the utility of batch structure, which allows for the first polynomial time algorithm for list-decodable regression, which may be impossible for the non-batch setting, as suggested by a recent SQ lower bound \cite{diakonikolas2021statistical} for the non-batch setting.

artificial intelligence, batch, machine learning, (15 more...)

arXiv.org Artificial Intelligence

Nov-23-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Maryland > Baltimore (0.04)
  - District of Columbia > Washington (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - California
    - San Diego County > San Diego (0.04)
    - Monterey County > Pacific Grove (0.04)
- Europe
  - Germany (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Afghanistan > Parwan Province
    - Charikar (0.04)

Genre:
- Research Report > New Finding (0.65)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.92)
  - Machine Learning > Statistical Learning
    - Regression (0.66)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found