Efficient Uncertainty Estimation for LLM-based Entity Linking in Tabular Data
Bono, Carlo, Belotti, Federico, Palmonari, Matteo
Linking textual values in tabular data to their corresponding entities in a Knowledge Base is a core task across a variety of data integration and enrichment applications. Although Large Language Models (LLMs) have shown State-of-The-Art performance in Entity Linking (EL) tasks, their deployment in real-world scenarios requires not only accurate predictions but also reliable uncertainty estimates, which require resource-demanding multi-shot inference, posing serious limits to their actual applicability. As a more efficient alternative, we investigate a self-supervised approach for estimating uncertainty from single-shot LLM outputs using token-level features, reducing the need for multiple generations. Evaluation is performed on an EL task on tabular data across multiple LLMs, showing that the resulting uncertainty estimates are highly effective in detecting low-accuracy outputs. This is achieved at a fraction of the computational cost, ultimately supporting a cost-effective integration of uncertainty measures into LLM-based EL workflows. The method offers a practical way to incorporate uncertainty estimation into EL workflows with limited computational overhead.
Oct-3-2025
- Country:
- Africa
- Eritrea (0.04)
- Ethiopia (0.05)
- Kenya (0.04)
- Middle East > Somalia (0.04)
- South Africa (0.04)
- Sudan (0.04)
- Asia > Japan (0.04)
- Europe
- Ireland (0.14)
- Italy (0.04)
- Spain
- Balearic Islands (0.04)
- Galicia > Madrid (0.06)
- Switzerland (0.04)
- United Kingdom > England
- Bedfordshire (0.04)
- North America
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- Illinois > Cook County
- Chicago (0.04)
- Iowa (0.04)
- New York
- Dutchess County (0.04)
- New York County > New York City (0.04)
- North Carolina (0.04)
- South Dakota (0.05)
- Illinois > Cook County
- Mexico > Mexico City
- Africa
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Government > Regional Government (0.46)
- Leisure & Entertainment (0.93)
- Technology: