A fine-grained comparison of pragmatic language understanding in humans and language models

Hu, Jennifer, Floyd, Sammy, Jouravlev, Olessia, Fedorenko, Evelina, Gibson, Edward

May-23-2023–arXiv.org Artificial Intelligence

Pragmatics and non-literal language understanding are essential to human communication, and present a long-standing challenge for artificial language models. We perform a fine-grained comparison of language models and humans on seven pragmatic phenomena, using zero-shot prompting on an expert-curated set of English materials. We ask whether models (1) select pragmatic interpretations of speaker utterances, (2) make similar error patterns as humans, and (3) use similar linguistic cues as humans to solve the tasks. We find that the largest models achieve high accuracy and match human error patterns: within incorrect responses, models favor literal interpretations over heuristic-based distractors. We also find preliminary evidence that models and humans are sensitive to similar linguistic cues. Our results suggest that pragmatic behaviors can emerge in models without explicitly constructed representations of mental states. However, models tend to struggle with phenomena relying on social expectation violations.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

May-23-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Washington > King County
      - Seattle (0.04)
    - New York > New York County
      - New York City (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Massachusetts > Middlesex County
      - Cambridge (0.14)
    - Illinois > Cook County
      - Chicago (0.04)
    - California > Los Angeles County
      - Los Angeles (0.14)
  - Canada > Ontario
    - National Capital Region > Ottawa (0.04)
- Europe
  - Italy (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
    - Oxfordshire > Oxford (0.04)
  - Netherlands > South Holland
    - Leiden (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Germany > Bavaria
    - Regensburg (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.14)
- Asia
  - China > Hong Kong (0.04)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.93)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.91)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found