Underspecification in Language Modeling Tasks: A Causality-Informed Study of Gendered Pronoun Resolution

Jul-17-2023–arXiv.org Artificial Intelligence

Modern language modeling tasks are often underspecified: for a given token prediction, many words may satisfy the user's intent of producing natural language at inference time, however only one word would minimize the task's loss function at training time. We provide a simple yet plausible causal mechanism describing the role underspecification plays in the generation of spurious correlations. Despite its simplicity, our causal model directly informs the development of two lightweight black-box evaluation methods, that we apply to gendered pronoun resolution tasks on a wide range of LLMs to 1) aid in the detection of inference-time task underspecification by exploiting 2) previously unreported gender vs. time and gender vs. location spurious correlations on LLMs with a range of A) sizes: from BERT-base to GPT 3.5, B) pre-training objectives: from masked & autoregressive language modeling to a mixture of these objectives, and C) training stages: from pre-training only to reinforcement learning from human feedback (RLHF). Code and open-source demos available at https: //github.com/2dot71mily/sib_paper.

correlation, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

Jul-17-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > New Zealand (0.04)
- North America > United States
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
  - California > Santa Clara County
    - Palo Alto (0.04)
- Europe
  - Spain > Canary Islands (0.04)
  - Iceland (0.04)
  - Switzerland (0.04)
  - Finland (0.04)
  - Norway (0.04)
  - Lithuania (0.04)
  - Sweden (0.04)
  - Ireland (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.14)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Pakistan (0.04)
  - Afghanistan (0.04)
  - Middle East
    - Yemen (0.04)
    - Syria (0.04)
    - Iraq (0.04)
    - Iran (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
- Africa
  - Mali (0.04)
  - Rwanda (0.04)
  - Namibia (0.04)
  - Democratic Republic of the Congo (0.04)

Genre:
- Research Report (0.51)

Industry:
- Health & Medicine (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found