Understanding partition comparison indices based on counting object pairs

Warrens, Matthijs J., van der Hoef, Hanneke

Jan-7-2019–arXiv.org Machine Learning

For example, in unsupervised machine learning, to evaluate theperformance of a clustering method, researchers typically assess agreement between a reference standard partition that purports to represent the true cluster structure of the objects (golden standard), and a trial partition produced by the method that is being evaluated (Wallace 1983; Halkidi, Batiskis and Vazirgiannis 2002; Jain 2010). High agreement between the two partitions may indicate good recovery of the true cluster structure. Agreement between partitions can be assessed with so-called external validity indices (Albatineh, Niewiadomska-Bugaj and Mihalko 2006; Brun et al. 2007; Warrens 2008a,2008b; Pfitzner et al. 2009). External validity indices can be roughly categorized into three approaches, namely 1) counting object pairs, 2) information theory (Vinh, Epps and Bailey 2010; Lei et al. 2016), and 3) matching sets (Rezaei and Fränti 2016). Most external validity indices are of the pair-counting approach, which is based on counting pairs of objects placed in identical and different clusters.

agreement, partition, rand index, (14 more...)

arXiv.org Machine Learning

Jan-7-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Kansas (0.04)
  - Massachusetts > Suffolk County
    - Boston (0.04)
  - California
    - San Francisco County > San Francisco (0.04)
    - Orange County > Irvine (0.04)
- Europe
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
- Asia
  - Singapore (0.04)
  - Japan (0.04)
  - Indonesia (0.04)

Genre:
- Research Report (0.40)

Industry:
- Health & Medicine (0.46)

Technology:
- Information Technology
  - Information Management (1.00)
  - Artificial Intelligence > Machine Learning
    - Statistical Learning > Clustering (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found