Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Jan-25-2025, 09:42:42 GMT–Neural Information Processing Systems

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for their decisions and updating their world knowledge remain open research problems. Pre-trained models with a differentiable access mechanism to explicit non-parametric memory can overcome this issue, but have so far been only investigated for extractive downstream tasks. We explore a general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation.

knowledge-intensive nlp task, non-parametric memory, retrieval-augmented generation, (2 more...)

Neural Information Processing Systems

Jan-25-2025, 09:42:42 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (0.64)
    - Chatbot (0.64)
  - Machine Learning > Neural Networks
    - Deep Learning (0.64)