Cluster Catch Digraphs with the Nearest Neighbor Distance
Shi, Rui, Billor, Nedret, Ceyhan, Elvan
We introduce a new method for clustering based on Cluster Catch Digraphs (CCDs). The new method addresses the limitations of RK-CCDs by employing a new variant of spatial randomness test that employs the nearest neighbor distance (NND) instead of the Ripley's K function used by RK-CCDs. We conduct a comprehensive Monte Carlo analysis to assess the performance of our method, considering factors such as dimensionality, data set size, number of clusters, cluster volumes, and inter-cluster distance. Our method is particularly effective for high-dimensional data sets, comparable to or outperforming KS-CCDs and RK-CCDs that rely on a KS-type statistic or the Ripley's K function. We also evaluate our methods using real and complex data sets, comparing them to well-known clustering methods. Again, our methods exhibit competitive performance, producing high-quality clusters with desirable properties. Keywords: Graph-based clustering, Cluster catch digraphs, High-dimensional data, The nearest neighbor distance, Spatial randomness test
Jan-9-2025
- Country:
- Asia
- China (0.04)
- Middle East
- Israel > Jerusalem District
- Jerusalem (0.04)
- Jordan (0.04)
- Israel > Jerusalem District
- Europe > Germany (0.04)
- North America > United States
- California (0.04)
- New Jersey > Hudson County
- Hoboken (0.04)
- New York
- Bronx County > New York City (0.04)
- Kings County > New York City (0.04)
- New York County > New York City (0.04)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Asia
- Genre:
- Research Report (0.82)
- Technology: