GENUINE: Graph Enhanced Multi-level Uncertainty Estimation for Large Language Models
Wang, Tuo, Kulkarni, Adithya, Cody, Tyler, Beling, Peter A., Yan, Yujun, Zhou, Dawei
–arXiv.org Artificial Intelligence
Uncertainty estimation is essential for enhancing the reliability of Large Language Models (LLMs), particularly in high-stakes applications. Existing methods often overlook semantic dependencies, relying on token-level probability measures that fail to capture structural relationships within the generated text. We propose GENUINE: Graph ENhanced mUlti-level uncertaINty Estimation for Large Language Models, a structure-aware framework that leverages dependency parse trees and hierarchical graph pooling to refine uncertainty quantification. By incorporating supervised learning, GENUINE effectively models semantic and structural relationships, improving confidence assessments. Extensive experiments across NLP tasks show that GENUINE achieves up to 29% higher AUROC than semantic entropy-based approaches and reduces calibration errors by over 15%, demonstrating the effectiveness of graph-based uncertainty modeling. The code is available at https://github.com/ODYSSEYWT/GUQ.
arXiv.org Artificial Intelligence
Sep-10-2025
- Country:
- Arctic Ocean > Barents Sea
- White Sea (0.04)
- Asia
- Europe
- Austria > Vienna (0.14)
- Holy See > Vatican City (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- North America
- Canada (0.04)
- United States
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Virginia (0.04)
- Massachusetts > Middlesex County
- South America > Colombia
- Bogotá D.C. > Bogotá (0.04)
- Arctic Ocean > Barents Sea
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine (0.46)
- Technology: