Parallel Key-Value Cache Fusion for Position Invariant RAG

Oh, Philhoon, Shin, Jinwoo, Thorne, James

Jan-13-2025–arXiv.org Artificial Intelligence

Recent advancements in Large Language Models (LLMs) underscore the necessity of Retrieval Augmented Generation (RAG) to leverage external information. However, LLMs are sensitive to the position of relevant information within contexts and tend to generate incorrect responses when such information is placed in the middle, known as `Lost in the Middle' phenomenon. In this paper, we introduce a framework that generates consistent outputs for decoder-only models, irrespective of the input context order. Experimental results for three open domain question answering tasks demonstrate position invariance, where the model is not sensitive to input context order, and superior robustness to irrelevent passages compared to prevailing approaches for RAG pipelines.

computational linguistic, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

Jan-13-2025

arXiv.org PDF

Add feedback

Country:
- Asia
  - China > Guangxi Province
    - Nanning (0.04)
  - Middle East > Jordan (0.04)
  - Singapore (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
- Europe > Italy
  - Calabria > Catanzaro Province > Catanzaro (0.04)
- North America
  - Canada (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - United States
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - New York > New York County
      - New York City (0.04)
    - Washington > King County
      - Seattle (0.04)
- South America > Argentina (0.05)

Genre:
- Personal > Honors (0.47)
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language
    - Chatbot (1.00)
    - Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found