A Closer Look at Claim Decomposition

Wanner, Miriam, Ebner, Seth, Jiang, Zhengping, Dredze, Mark, Van Durme, Benjamin

Mar-18-2024–arXiv.org Artificial Intelligence

As generated text becomes more commonplace, it is increasingly important to evaluate how well-supported such text is by external knowledge sources. Many approaches for evaluating textual support rely on some method for decomposing text into its individual subclaims which are scored against a trusted reference. We investigate how various methods of claim decomposition -- especially LLM-based methods -- affect the result of an evaluation approach such as the recently proposed FActScore, finding that it is sensitive to the decomposition method used. This sensitivity arises because such metrics attribute overall textual support to the model that generated the text even though error can also come from the metric's decomposition step. To measure decomposition quality, we introduce an adaptation of FActScore, which we call DecompScore. We then propose an LLM-based approach to generating decompositions inspired by Bertrand Russell's theory of logical atomism and neo-Davidsonian semantics and demonstrate its improved decomposition quality over previous methods.

alfred hitchcock, decomposition, subclaim, (13 more...)

arXiv.org Artificial Intelligence

Mar-18-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.04)
  - United States
    - Minnesota (0.04)
    - Alabama (0.04)
    - New York (0.04)
    - Washington > King County
      - Seattle (0.04)
    - Texas > Travis County
      - Austin (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Czechia > Prague (0.04)
  - Spain > Valencian Community
    - Valencia Province > Valencia (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
- Asia
  - Singapore (0.04)
  - South Korea > Seoul
    - Seoul (0.04)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)
  - Japan > Honshū
    - Tōhoku > Iwate Prefecture > Morioka (0.04)

Genre:
- Research Report (0.50)

Industry:
- Media
  - Film (1.00)
  - Music (0.68)
- Leisure & Entertainment > Sports
  - Football (0.67)
- Government > Regional Government
  - North America Government > United States Government (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Grammars & Parsing (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found