Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement
Abercrombie, Gavin, Rieser, Verena, Hovy, Dirk
–arXiv.org Artificial Intelligence
We commonly use agreement measures to assess the utility of judgements made by human annotators in Natural Language Processing (NLP) tasks. While inter-annotator agreement is frequently used as an indication of label reliability by measuring consistency between annotators, we argue for the additional use of intra-annotator agreement to measure label stability over time. However, in a systematic review, we find that the latter is rarely reported in this field. Calculating these measures can act as important quality control and provide insights into why annotators disagree. We propose exploratory annotation experiments to investigate the relationships between these measures and perceptions of subjectivity and ambiguity in text items.
arXiv.org Artificial Intelligence
Jan-25-2023
- Country:
- Oceania > Australia (0.04)
- North America
- United States
- Ohio (0.04)
- Maryland > Baltimore (0.04)
- Washington > King County
- Seattle (0.14)
- Oregon > Multnomah County
- Portland (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Colorado > Denver County
- Denver (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- Spain (0.04)
- Czechia > Prague (0.04)
- Germany > Berlin (0.04)
- Belgium (0.04)
- Ukraine (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Greece > Attica
- Athens (0.04)
- Portugal > Lisbon
- Lisbon (0.14)
- Italy
- Middle East
- Netherlands > South Holland
- Dordrecht (0.04)
- Norway > Western Norway
- Sweden
- Östergötland County > Linköping (0.04)
- Uppsala County > Uppsala (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Asia
- Singapore (0.04)
- Middle East
- Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Antalya Province > Antalya (0.04)
- Qatar > Ad-Dawhah
- Doha (0.04)
- Oman > Muscat Governorate
- Muscat (0.04)
- Republic of Türkiye
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- China > Fujian Province
- Xiamen (0.04)
- Africa > Middle East
- Morocco (0.04)
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine (0.93)
- Technology: