CLUE: Concept-Level Uncertainty Estimation for Large Language Models

Wang, Yu-Hsiang, Bai, Andrew, Tsai, Che-Ping, Hsieh, Cho-Jui

Sep-4-2024–arXiv.org Artificial Intelligence

Large Language Models (LLMs) have demonstrated remarkable proficiency in various natural language generation (NLG) tasks. Previous studies suggest that LLMs' generation process involves uncertainty. However, existing approaches to uncertainty estimation mainly focus on sequence-level uncertainty, overlooking individual pieces of information within sequences. These methods fall short in separately assessing the uncertainty of each component in a sequence. In response, we propose a novel framework for Concept-Level Uncertainty Estimation (CLUE) for LLMs. We leverage LLMs to convert output sequences into concept-level representations, breaking down sequences into individual concepts and measuring the uncertainty of each concept separately. We conduct experiments to demonstrate that CLUE can provide more interpretable uncertainty estimation results compared with sentence-level uncertainty, and could be a useful tool for various tasks such as hallucination detection and story generation.

hallucination, output sequence, sequence, (16 more...)

arXiv.org Artificial Intelligence

Sep-4-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Texas (0.04)
  - Pennsylvania
    - Philadelphia County > Philadelphia (0.04)
    - Allegheny County > Pittsburgh (0.04)
  - Michigan > Oakland County
    - Farmington Hills (0.04)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
  - California
    - Los Angeles County > Los Angeles (0.14)
    - Santa Clara County > Cupertino (0.04)
    - San Diego County > San Diego (0.04)
- Europe
  - Germany (0.04)
  - Czechia > Prague (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Taiwan (0.04)
  - Singapore (0.04)
  - China > Hong Kong (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Leisure & Entertainment > Sports > Basketball (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Performance Analysis
    - Accuracy (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found