Cross-replication Reliability -- An Empirical Approach to Interpreting Inter-rater Reliability

Wong, Ka, Paritosh, Praveen, Aroyo, Lora

Jun-11-2021–arXiv.org Artificial Intelligence

We present a new approach to interpreting IRR that is empirical and contextualized. It is based upon benchmarking IRR against baseline measures in a replication, one of which is a novel cross-replication reliability (xRR) measure based on Cohen's kappa. We call this approach the xRR framework. We opensource a replication dataset of 4 million human judgements of facial expressions and analyze it with the proposed framework. We argue this framework can be used to measure the quality of crowdsourced datasets.

health & medicine, replication, survey article, (20 more...)

arXiv.org Artificial Intelligence

Jun-11-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Hawaii (0.14)
  - New York (0.14)
  - Texas (0.14)

Genre:
- Overview (0.46)
- Research Report (0.64)

Industry:
- Health & Medicine (0.93)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (0.95)
  - Communications > Social Media
    - Crowdsourcing (0.36)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found