Towards Property-Based Classification of Clustering Paradigms

Ackerman, Margareta, Ben-David, Shai, Loker, David

Neural Information Processing Systems 

Clustering is a basic data mining task with a wide variety of applications. Not surprisingly, there exist many clustering algorithms. However, clustering is an ill defined problem - given a data set, it is not clear what a "correct" clustering for that set is. Indeed, different algorithms may yield dramatically different outputs for the same input sets. Faced with a concrete clustering task, a user needs to choose an appropriate clustering algorithm.