Sparse PCA via Bipartite Matchings

Mar-12-2024, 21:57:30 GMT–Neural Information Processing Systems

We consider the following multi-component sparse PCA problem: given a set of data points, we seek to extract a small number of sparse components with disjoint supports that jointly capture the maximum possible variance. Such components can be computed one by one, repeatedly solving the single-component problem and deflating the input data matrix, but this greedy procedure is suboptimal. We present a novel algorithm for sparse PCA that jointly optimizes multiple disjoint components. The extracted features capture variance that lies within a multiplicative factor arbitrarily close to 1 from the optimal. Our algorithm is combinatorial and computes the desired components by solving multiple instances of the bipartite maximum weight matching problem. Its complexity grows as a low order polynomial in the ambient dimension of the input data, but exponentially in its rank. However, it can be effectively applied on a low-dimensional sketch of the input data. We evaluate our algorithm on real datasets and empirically demonstrate that in many cases it outperforms existing, deflation-based approaches.

algorithm, pca problem, sparse pca problem, (14 more...)

Neural Information Processing Systems

Mar-12-2024, 21:57:30 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Texas > Travis County
    - Austin (0.04)
  - New York > New York County
    - New York City (0.04)
  - California
    - Santa Clara County > Palo Alto (0.04)
    - Alameda County > Berkeley (0.04)
- Asia > Middle East
  - Jordan (0.04)

Industry:
- Government > Regional Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning > Search (0.66)