Unsupervised Feature Selection for the $k$-means Clustering Problem

Boutsidis, Christos, Drineas, Petros, Mahoney, Michael W.

Dec-31-2009–Neural Information Processing Systems

We present a novel feature selection algorithm for the $k$-means clustering problem. Our algorithm is randomized and, assuming an accuracy parameter $\epsilon \in (0,1)$, selects and appropriately rescales in an unsupervised manner $\Theta(k \log(k / \epsilon) / \epsilon^2)$ features from a dataset of arbitrary dimensions. We prove that, if we run any $\gamma$-approximate $k$-means algorithm ($\gamma \geq 1$) on the features selected using our method, we can find a $(1+(1+\epsilon)\gamma)$-approximate partition with high probability.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Dec-31-2009

Conferences PDF

Add feedback

Country:
- North America > United States (1.00)

Industry:
- Health & Medicine
  - Pharmaceuticals & Biotechnology (0.68)
  - Therapeutic Area > Oncology (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found