Causality for Natural Language Processing

Apr-22-2025–arXiv.org Artificial Intelligence

In the field of natural language processing (NLP), the capability to infer and reason about causality is increasingly recognized as a critical component of intelligent systems. Despite the recent advancement of large language models (LLMs) (Radford et al., 2019; Devlin et al., 2019; Brown et al., 2020; Zhang et al., 2022; OpenAI, 2023; Ignat et al., 2024, inter alia), a key question still remains: Can these models understand and reason about causality? This is a critical skill before we can trust AI agents to be integrated into decision-making systems. Moreover, even if LLMs succeed at some extent of reasoning, they still lack transparency of how their decisions are made, forming a strong need for interpretabil-ity (Luo and Specia, 2024; Räuker et al., 2023; Zou et al., 2023). T o bridge the gap, this thesis explores various facets of causal reasoning in LLMs. W e present a series of studies that collectively advance the knowledge of how well these models perform causal reasoning (Part I), how their decisions are made (Part II), how causality among learning variables influences NLP tasks (Part III), and how causality and NLP can together analyze social problems (Part IV). Below we introduce an overview of the four parts and their corresponding chapters.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

Apr-22-2025

arXiv.org PDF

Add feedback

Country:
- Africa (0.92)
- North America
  - United States
    - Colorado (0.27)
    - California (0.27)
    - Minnesota (0.27)
    - Oregon (0.27)
    - Washington > King County
      - Seattle (0.27)
  - Canada
    - Quebec (0.27)
    - British Columbia (0.27)
- Europe > United Kingdom
  - England (0.27)
- Asia > Middle East
  - UAE (0.45)

Genre:
- Workflow (1.00)
- Overview (1.00)
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Consumer Products & Services (1.00)
- Information Technology > Security & Privacy (1.00)
- Law (0.92)
- Education > Educational Setting (0.92)
- Social Sector (0.87)
- Banking & Finance > Economy (0.67)
- Media > News (0.67)
- Health & Medicine
  - Epidemiology (1.00)
  - Therapeutic Area
    - Infections and Infectious Diseases (1.00)
    - Immunology (1.00)
    - Psychiatry/Psychology (0.67)
    - Pulmonary/Respiratory Diseases (0.67)
- Government
  - Voting & Elections (0.67)
  - Regional Government > North America Government
    - United States Government (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Performance Analysis > Accuracy (0.92)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.92)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found