Enabling clustering algorithms to detect clusters of varying densities through scale-invariant data preprocessing

Aryal, Sunil, Wells, Jonathan R., Baniya, Arbind Agrahari, Santosh, KC

Jan-20-2024–arXiv.org Artificial Intelligence

In this paper, we show that preprocessing data using a variant of rank transformation called 'Average Rank over an Ensemble of Sub-samples (ARES)' makes clustering algorithms robust to data representation and enable them to detect varying density clusters. Our empirical results, obtained using three most widely used clustering algorithms-namely KMeans, DBSCAN, and DP (Density Peak)-across a wide range of real-world datasets, show that clustering after ARES transformation produces better and more consistent results.

algorithm, representation, transformation, (15 more...)

arXiv.org Artificial Intelligence

Jan-20-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia (0.05)
- Asia (0.04)
- North America > United States
  - District of Columbia > Washington (0.04)
  - South Dakota > Clay County
    - Vermillion (0.04)
  - New York > New York County
    - New York City (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence > Machine Learning
    - Statistical Learning > Clustering (1.00)