Thrust: Adaptively Propels Large Language Models with External Knowledge

Jan-20-2025, 00:26:48 GMT–Neural Information Processing Systems

Although large-scale pre-trained language models (PTLMs) are shown to encode rich knowledge in their model parameters, the inherent knowledge in PTLMs can be opaque or static, making external knowledge necessary. However, the existing information retrieval techniques could be costly and may even introduce noisy and sometimes misleading knowledge. To address these challenges, we propose the instance-level adaptive propulsion of external knowledge (IAPEK), where we only conduct the retrieval when necessary. To achieve this goal, we propose to model whether a PTLM contains enough knowledge to solve an instance with a novel metric, Thrust, which leverages the representation distribution of a small amount of seen instances. Extensive experiments demonstrate that Thrust is a good measurement of models' instance-level knowledgeability.

external knowledge, knowledge, thrust, (3 more...)

Neural Information Processing Systems

Jan-20-2025, 00:26:48 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language
  - Information Retrieval (0.63)
  - Large Language Model (0.40)