Understanding the Limits of Lifelong Knowledge Editing in LLMs
Thede, Lukas, Roth, Karsten, Bethge, Matthias, Akata, Zeynep, Hartvigsen, Tom
–arXiv.org Artificial Intelligence
Keeping large language models factually up-to-date is crucial for deployment, yet costly retraining remains a challenge. Knowledge editing offers a promising alternative, but methods are only tested on small-scale or synthetic edit benchmarks. In this work, we aim to bridge research into lifelong knowledge editing to real-world edits at practically relevant scale. We first introduce WikiBigEdit; a large-scale benchmark of real-world Wikidata edits, built to automatically extend lifelong for future-proof benchmarking. In its first instance, it includes over 500K question-answer pairs for knowledge editing alongside a comprehensive evaluation pipeline. Finally, we use WikiBigEdit to study existing knowledge editing techniques' ability to incorporate large volumes of real-world facts and contrast their capabilities to generic modification techniques such as retrieval augmentation and continual finetuning to acquire a complete picture of the practical extent of current lifelong knowledge editing.
arXiv.org Artificial Intelligence
Mar-7-2025
- Country:
- Europe (0.93)
- North America > United States (0.46)
- Genre:
- Research Report > New Finding (0.45)
- Technology: