Nearly Linear Sparsification of $\ell_p$ Subspace Approximation
Woodruff, David P., Yasuda, Taisuke
–arXiv.org Artificial Intelligence
The $\ell_p$ subspace approximation problem is an NP-hard low rank approximation problem that generalizes the median hyperplane problem ($p = 1$), principal component analysis ($p = 2$), and the center hyperplane problem ($p = \infty$). A popular approach to cope with the NP-hardness of this problem is to compute a strong coreset, which is a small weighted subset of the input points which simultaneously approximates the cost of every $k$-dimensional subspace, typically to $(1+\varepsilon)$ relative error for a small constant $\varepsilon$. We obtain the first algorithm for constructing a strong coreset for $\ell_p$ subspace approximation with a nearly optimal dependence on the rank parameter $k$, obtaining a nearly linear bound of $\tilde O(k)\mathrm{poly}(\varepsilon^{-1})$ for $p<2$ and $\tilde O(k^{p/2})\mathrm{poly}(\varepsilon^{-1})$ for $p>2$. Prior constructions either achieved a similar size bound but produced a coreset with a modification of the original points [SW18, FKW21], or produced a coreset of the original points but lost $\mathrm{poly}(k)$ factors in the coreset size [HV20, WY23]. Our techniques also lead to the first nearly optimal online strong coresets for $\ell_p$ subspace approximation with similar bounds as the offline setting, resolving a problem of [WY23]. All prior approaches lose $\mathrm{poly}(k)$ factors in this setting, even when allowed to modify the original points.
arXiv.org Artificial Intelligence
Jul-3-2024
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- Maryland > Baltimore (0.04)
- Texas > Travis County
- Austin (0.04)
- Florida
- Orange County > Orlando (0.04)
- Miami-Dade County > Miami (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Colorado
- Denver County > Denver (0.04)
- Boulder County > Boulder (0.04)
- Pennsylvania
- Allegheny County > Pittsburgh (0.14)
- Philadelphia County > Philadelphia (0.04)
- North Carolina > Durham County
- Durham (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Alameda County > Berkeley (0.04)
- Santa Cruz County > Santa Cruz (0.04)
- San Diego County > San Diego (0.04)
- Santa Clara County
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- Switzerland > Zürich
- Zürich (0.14)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy
- Iceland > Capital Region
- Reykjavik (0.04)
- France > Île-de-France
- Switzerland > Zürich
- Asia
- Middle East > Israel
- Jerusalem District > Jerusalem (0.04)
- Japan > Honshū
- Kansai > Kyoto Prefecture > Kyoto (0.04)
- India > Telangana
- Hyderabad (0.04)
- China > Guangdong Province
- Shenzhen (0.04)
- Afghanistan > Parwan Province
- Charikar (0.04)
- Middle East > Israel
- Oceania > Australia
- Genre:
- Research Report (0.50)
- Technology: