Investigating Reasons for Disagreement in Natural Language Inference

Jiang, Nan-Jiang, de Marneffe, Marie-Catherine

Sep-7-2022–arXiv.org Artificial Intelligence

We investigate how disagreement in natural language inference (NLI) annotation arises. We developed a taxonomy of disagreement sources with 10 categories spanning 3 high-level classes. We found that some disagreements are due to uncertainty in the sentence meaning, others to annotator biases and task artifacts, leading to different interpretations of the label distribution. We explore two modeling approaches for detecting items with potential disagreement: a 4-way classification with a "Complicated" label in addition to the three standard NLI labels, and a multilabel classification approach. We found that the multilabel classification is more expressive and gives better recall of the possible interpretations in the data.

annotation, computational linguistic, disagreement, (15 more...)

arXiv.org Artificial Intelligence

Sep-7-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.04)
  - United States
    - Michigan (0.04)
    - Maryland > Baltimore (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.04)
    - Ohio > Franklin County
      - Columbus (0.04)
    - Arizona > Maricopa County
      - Phoenix (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Oregon > Multnomah County
      - Portland (0.04)
    - North Carolina > Rowan County
      - Salisbury (0.04)
    - Massachusetts > Hampshire County
      - Amherst (0.04)
    - New York > New York County
      - New York City (0.04)
- Europe
  - Ireland (0.04)
  - Czechia > Prague (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - Iceland > Capital Region
    - Reykjavik (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Sweden
    - Uppsala County > Uppsala (0.04)
    - Vaestra Goetaland > Gothenburg (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Singapore (0.04)
  - India (0.04)
  - China > Hong Kong (0.04)
  - Middle East > Israel
    - Tel Aviv District > Tel Aviv (0.04)
    - Southern District > Eilat (0.04)
    - Haifa District > Haifa (0.04)
- Africa > Middle East
  - Egypt > Cairo Governorate > Cairo (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found