CompAct: Compressing Retrieved Documents Actively for Question Answering

Yoon, Chanwoong, Lee, Taewhoo, Hwang, Hyeon, Jeong, Minbyul, Kang, Jaewoo

Jul-15-2024–arXiv.org Artificial Intelligence

Retrieval-augmented generation supports language models to strengthen their factual groundings by providing external contexts. However, language models often face challenges when given extensive information, diminishing their effectiveness in solving questions. Context compression tackles this issue by filtering out irrelevant information, but current methods still struggle in realistic scenarios where crucial information cannot be captured with a single-step approach. To overcome this limitation, we introduce CompAct, a novel framework that employs an active strategy to condense extensive documents without losing key information. Our experiments demonstrate that CompAct brings significant improvements in both performance and compression rate on multi-hop question-answering (QA) benchmarks. CompAct flexibly operates as a cost-efficient plug-in module with various off-the-shelf retrievers or readers, achieving exceptionally high compression rates (47x).

dataset, information, omp, (16 more...)

arXiv.org Artificial Intelligence

Jul-15-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Spain (0.04)
- North America
  - Mexico > Sinaloa (0.04)
  - United States
    - Texas > Travis County
      - Austin (0.04)
    - California > Los Angeles County
      - Downey (0.04)
- Asia
  - Singapore (0.04)
  - China > Guangxi Province
    - Nanning (0.04)

Genre:
- Research Report (0.64)

Industry:
- Law > Criminal Law (0.68)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found