Robust Sub-Gaussian Principal Component Analysis and Width-Independent Schatten Packing
Jambulapati, Arun, Li, Jerry, Tian, Kevin
We develop two methods for the following fundamental statistical task: given an $\epsilon$-corrupted set of $n$ samples from a $d$-dimensional sub-Gaussian distribution, return an approximate top eigenvector of the covariance matrix. Our first robust PCA algorithm runs in polynomial time, returns a $1 - O(\epsilon\log\epsilon^{-1})$-approximate top eigenvector, and is based on a simple iterative filtering approach. Our second, which attains a slightly worse approximation factor, runs in nearly-linear time and sample complexity under a mild spectral gap assumption. These are the first polynomial-time algorithms yielding non-trivial information about the covariance of a corrupted sub-Gaussian distribution without requiring additional algebraic structure of moments. As a key technical tool, we develop the first width-independent solvers for Schatten-$p$ norm packing semidefinite programs, giving a $(1 + \epsilon)$-approximate solution in $O(p\log(\tfrac{nd}{\epsilon})\epsilon^{-1})$ input-sparsity time iterations (where $n$, $d$ are problem dimensions).
Jun-12-2020
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- Massachusetts (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- California
- Santa Clara County > Palo Alto (0.04)
- San Diego County > San Diego (0.04)
- Riverside County > Palm Springs (0.04)
- Arizona > Maricopa County
- Phoenix (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- Asia > Afghanistan
- Parwan Province > Charikar (0.04)
- Oceania > Australia
- Genre:
- Research Report (0.50)