Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition

Ramu, Pritika, Goswami, Koustava, Saxena, Apoorv, Srinivasan, Balaji Vasan

Nov-23-2024–arXiv.org Artificial Intelligence

Accurately attributing answer text to its source document is crucial for developing a reliable question-answering system. However, attribution for long documents remains largely unexplored. Post-hoc attribution systems are designed to map answer text back to the source document, yet the granularity of this mapping has not been addressed. Furthermore, a critical question arises: What exactly should be attributed? This involves identifying the specific information units within an answer that require grounding. In this paper, we propose and investigate a novel approach to the factual decomposition of generated answers for attribution, employing template-based in-context learning. To accomplish this, we utilize the question and integrate negative sampling during few-shot in-context learning for decomposition. This approach enhances the semantic understanding of both abstractive and extractive answers. We examine the impact of answer decomposition by providing a thorough examination of various attribution approaches, ranging from retrieval-based techniques to LLM-based attributors.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Nov-23-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States (0.28)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe > Italy
  - Marche > Ancona Province > Ancona (0.04)
- Asia
  - Singapore (0.04)
  - India (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)

Genre:
- Research Report (0.84)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.95)
  - Machine Learning > Neural Networks
    - Deep Learning (0.70)