ScopeQA: A Framework for Generating Out-of-Scope Questions for RAG

Peng, Zhiyuan, Nian, Jinming, Evfimievski, Alexandre, Fang, Yi

Dec-19-2024–arXiv.org Artificial Intelligence

Conversational AI agents use Retrieval Augmented Generation (RAG) to provide verifiable document-grounded responses to user inquiries. However, many natural questions do not have good answers: about 25\% contain false assumptions~\cite{Yu2023:CREPE}, and over 50\% are ambiguous~\cite{DBLP:conf/emnlp/MinMHZ20}. RAG agents need high-quality data to improve their responses to confusing questions. This paper presents a novel guided hallucination-based method to efficiently generate a diverse set of borderline out-of-scope confusing questions for a given document corpus. We conduct an empirical comparative evaluation of several large language models as RAG agents to measure the accuracy of confusion detection and appropriate response generation. We contribute a benchmark dataset to the public domain.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Dec-19-2024

arXiv.org PDF

Add feedback

Country:
- Africa > Rwanda
  - Kigali > Kigali (0.04)
- Asia
  - China (0.04)
  - Middle East > Jordan (0.04)
  - Singapore (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Middle East > Malta (0.04)
  - Spain (0.04)
- North America
  - Canada
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
    - Ontario > Toronto (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - United States
    - California > Santa Clara County
      - Santa Clara (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Minnesota (0.04)
    - New York (0.04)

Genre:
- Research Report (0.64)

Industry:
- Leisure & Entertainment > Sports > Football (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.70)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found