rocha
- Europe > Iceland (0.25)
- Asia > Middle East > Iran (0.17)
- North America > Cuba (0.05)
- (14 more...)
- Media > News (1.00)
- Health & Medicine > Therapeutic Area > Neurology > Autism (0.86)
- Government > Regional Government > North America Government > United States Government (0.69)
Rocha
Clustering analysis has become a ubiquitous information retrieval tool in a wide range of domains, but a more automatic framework is still lacking. Though internal metrics are the key players towards a successful retrieval of clusters, their effectiveness on real-world datasets remains not fully understood, mainly because of their unrealistic assumptions underlying datasets. We validated the InfoGuide hypothesis by capturing the traces of information gain using the Kolmogorov-Smirnov statistic and comparing the clusters retrieved by InfoGuide against those retrieved by other commonly used internal metrics in artificially-generated, benchmarks, and real-world datasets. Our results suggested that InfoGuide can enable a more automatic clustering analysis and may be more suitable for retrieving clusters in real-world datasets displaying nontrivial statistical properties.