Knowledge Base Completion for Long-Tail Entities

Chen, Lihu, Razniewski, Simon, Weikum, Gerhard

Jun-30-2023–arXiv.org Artificial Intelligence

Despite their impressive scale, knowledge bases (KBs), such as Wikidata, still contain significant gaps. Language models (LMs) have been proposed as a source for filling these gaps. However, prior works have focused on prominent entities with rich coverage by LMs, neglecting the crucial case of long-tail entities. In this paper, we present a novel method for LM-based-KB completion that is specifically geared for facts about long-tail entities. The method leverages two different LMs in two stages: for candidate retrieval and for candidate verification and disambiguation. To evaluate our method and various baselines, we introduce a novel dataset, called MALT, rooted in Wikidata. Our method outperforms all baselines in F1, with major gains especially in recall.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Jun-30-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Alabama > Jefferson County > Birmingham (0.04)
- Europe
  - Germany > Saarland
    - Saarbrücken (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)

Genre:
- Research Report > Promising Solution (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Expert Systems (0.71)
  - Natural Language > Large Language Model (0.69)
  - Machine Learning > Neural Networks
    - Deep Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found