LoGU: Long-form Generation with Uncertainty Expressions

Yang, Ruihan, Zhang, Caiqi, Zhang, Zhisong, Huang, Xinting, Yang, Sen, Collier, Nigel, Yu, Dong, Yang, Deqing

Oct-24-2024–arXiv.org Artificial Intelligence

While Large Language Models (LLMs) demonstrate impressive capabilities, they still struggle with generating factually incorrect content (i.e., hallucinations). A promising approach to mitigate this issue is enabling models to express uncertainty when unsure. Previous research on uncertainty modeling has primarily focused on short-form QA, but realworld applications often require much longer responses. In this work, we introduce the task of Long-form Generation with Uncertainty(LoGU). We identify two key challenges: Uncertainty Suppression, where models hesitate to express uncertainty, and Uncertainty Misalignment, where models convey uncertainty inaccurately. To tackle these challenges, we propose a refinement-based data collection framework and a two-stage training pipeline. Our framework adopts a divide-and-conquer strategy, refining uncertainty based on atomic claims. The collected data are then used in training through supervised fine-tuning (SFT) and direct preference optimization (DPO) to enhance uncertainty expression. Extensive experiments on three long-form instruction following datasets show that our method significantly improves accuracy, reduces hallucinations, and maintains the comprehensiveness of responses.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

Oct-24-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Virginia (0.04)
    - California > Los Angeles County
      - Los Angeles (0.04)
  - Mexico
    - Sinaloa (0.04)
    - Mexico City > Mexico City (0.04)
- Europe > United Kingdom
  - England
    - Greater London > London (0.04)
    - Cambridgeshire > Cambridge (0.04)
- Asia
  - Singapore (0.04)
  - China > Hong Kong (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - South Korea > Seoul
    - Seoul (0.04)

Genre:
- Personal > Obituary (0.46)
- Research Report
  - New Finding (0.67)
  - Promising Solution (0.48)

Industry:
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.93)
- Health & Medicine > Health Care Technology
  - Telehealth (0.46)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found