Data Clustering and Similarity
Soler, Julien (Virtualys, Université Européenne de Bretagne) | Tencé, Fabien (Virtualys) | Gaubert, Laurent (Université Européenne de Bretagne) | Buche, Cédric (Université européenne de Bretagne)
In this article, we study the notion of similarity within the context of cluster analysis. We begin by studying different distances commonly used for this task and highlight certain important properties that they might have, such as the use of data distribution or reduced sensitivity to the curse of dimensionality. Then we study inter- and intra-cluster similarities. We identify how the choices made can influence the nature of the clusters.
- Technology: