Autoregressive Language Models for Knowledge Base Population: A case study in the space mission domain
García-Silva, Andrés, Gómez-Pérez, José Manuel
–arXiv.org Artificial Intelligence
Knowledge base population KBP plays a crucial role in populating and maintaining knowledge bases up-to-date in organizations by leveraging domain corpora. Motivated by the increasingly large context windows supported by large language models, we propose to fine-tune an autoregressive language model for end-toend KPB. Our case study involves the population of a space mission knowledge graph. To fine-tune the model we generate a dataset for end-to-end KBP tapping into existing domain resources. Our case study shows that fine-tuned language models of limited size can achieve competitive and even higher accuracy than larger models in the KBP task. Smaller models specialized for KBP offer affordable deployment and lower-cost inference. Moreover, KBP specialist models do not require the ontology to be included in the prompt, allowing for more space in the context for additional input text or output serialization.
arXiv.org Artificial Intelligence
Mar-24-2025
- Country:
- Asia
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- Middle East > Jordan (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Japan > Kyūshū & Okinawa
- Europe
- North America
- Dominican Republic (0.04)
- United States
- California > Santa Clara County
- Palo Alto (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- Oregon (0.04)
- California > Santa Clara County
- Asia
- Genre:
- Research Report (0.40)
- Technology: