Size Regularized Cut for Data Clustering
Chen, Yixin, Zhang, Ya, Ji, Xiang
–Neural Information Processing Systems
We present a novel spectral clustering method that enables users to incorporate priorknowledge of the size of clusters into the clustering process. The cost function, which is named size regularized cut (SRcut), is defined as the sum of the inter-cluster similarity and a regularization term measuring therelative size of two clusters. Finding a partition of the data set to minimize SRcut is proved to be NPcomplete. An approximation algorithm isproposed to solve a relaxed version of the optimization problem as an eigenvalue problem. Evaluations over different data sets demonstrate thatthe method is not sensitive to outliers and performs better than normalized cut.
Neural Information Processing Systems
Dec-31-2006
- Country:
- North America > United States
- Kansas (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia
- Middle East > Jordan (0.04)
- Afghanistan > Parwan Province
- Charikar (0.04)
- North America > United States
- Genre:
- Research Report (0.47)
- Technology: