Enabling clustering algorithms to detect clusters of varying densities through scale-invariant data preprocessing
Aryal, Sunil, Wells, Jonathan R., Baniya, Arbind Agrahari, Santosh, KC
–arXiv.org Artificial Intelligence
In this paper, we show that preprocessing data using a variant of rank transformation called 'Average Rank over an Ensemble of Sub-samples (ARES)' makes clustering algorithms robust to data representation and enable them to detect varying density clusters. Our empirical results, obtained using three most widely used clustering algorithms-namely KMeans, DBSCAN, and DP (Density Peak)-across a wide range of real-world datasets, show that clustering after ARES transformation produces better and more consistent results.
arXiv.org Artificial Intelligence
Jan-20-2024
- Country:
- Asia (0.04)
- North America > United States
- District of Columbia > Washington (0.04)
- New York > New York County
- New York City (0.04)
- South Dakota > Clay County
- Vermillion (0.04)
- Oceania > Australia (0.05)
- Genre:
- Research Report (1.00)
- Technology: