Fuzzy clustering of distribution-valued data using adaptive L2 Wasserstein distances
Irpino, Antonio, De Carvalho, Francisco, Verde, Rosanna
Distributional (or distribution-valued) data are a new type of data arising from several sources and are considered as realizations of distributional variables. A new set of fuzzy c-means algorithms for data described by distributional variables is proposed. The algorithms use the $L2$ Wasserstein distance between distributions as dissimilarity measures. Beside the extension of the fuzzy c-means algorithm for distributional data, and considering a decomposition of the squared $L2$ Wasserstein distance, we propose a set of algorithms using different automatic way to compute the weights associated with the variables as well as with their components, globally or cluster-wise. The relevance weights are computed in the clustering process introducing product-to-one constraints. The relevance weights induce adaptive distances expressing the importance of each variable or of each component in the clustering process, acting also as a variable selection method in clustering. We have tested the proposed algorithms on artificial and real-world data. Results confirm that the proposed methods are able to better take into account the cluster structure of the data with respect to the standard fuzzy c-means, with non-adaptive distances.
May-2-2016
- Country:
- Africa
- Ghana (0.04)
- Middle East > Tunisia (0.04)
- Namibia (0.04)
- Saint Helena, Ascension and Tristan da Cunha (0.04)
- Western Sahara (0.04)
- Asia
- Azerbaijan (0.04)
- Kazakhstan (0.04)
- Laos (0.04)
- Middle East > Syria (0.04)
- Nepal (0.04)
- Pakistan (0.04)
- Philippines (0.04)
- Taiwan (0.04)
- Europe
- Iceland (0.04)
- Italy (0.04)
- Liechtenstein (0.04)
- North Macedonia (0.04)
- Norway (0.04)
- Poland (0.04)
- Romania (0.04)
- Slovakia (0.04)
- North America
- Sint Maarten (0.04)
- Cuba (0.04)
- Haiti (0.04)
- The Bahamas (0.04)
- United States
- Michigan (0.04)
- New Jersey > Hudson County
- Hoboken (0.04)
- New York (0.04)
- Oregon (0.04)
- Belize (0.04)
- Honduras (0.04)
- Puerto Rico (0.04)
- Costa Rica (0.04)
- Saint Martin (0.04)
- Panama (0.04)
- Montserrat (0.04)
- Oceania
- Australia (0.04)
- Kiribati (0.04)
- New Zealand (0.04)
- Guam (0.04)
- Solomon Islands (0.04)
- New Caledonia (0.04)
- Papua New Guinea (0.04)
- Palau (0.04)
- French Polynesia (0.04)
- South America > Brazil
- Pernambuco > Recife (0.04)
- Africa
- Genre:
- Research Report (0.70)
- Technology: