Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization

Oct-13-2024–arXiv.org Artificial Intelligence

This paper investigates the design of a unified search engine to serve multiple retrieval-augmented generation (RAG) agents, each with a distinct task, backbone large language model (LLM), and retrieval-augmentation strategy. We introduce an iterative approach where the search engine generates retrieval results for these RAG agents and gathers feedback on the quality of the retrieved documents during an offline phase. This feedback is then used to iteratively optimize the search engine using a novel expectation-maximization algorithm, with the goal of maximizing each agent's utility function. Additionally, we adapt this approach to an online setting, allowing the search engine to refine its behavior based on real-time individual agents feedback to better serve the results for each of them. Experiments on diverse datasets from the Knowledge-Intensive Language Tasks (KILT) benchmark demonstrates that our approach significantly on average outperforms competitive baselines across 18 RAG models. We also demonstrate that our method effectively ``personalizes'' the retrieval process for each RAG agent based on the collected feedback. Finally, we provide a comprehensive ablation study to explore various aspects of our method.

information retrieval, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Oct-13-2024

arXiv.org PDF

Add feedback

Country:
- South America > Paraguay
  - Asunción > Asunción (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - District of Columbia > Washington (0.04)
    - Washington > King County
      - Seattle (0.14)
    - Texas > Travis County
      - Austin (0.04)
    - New York > New York County
      - New York City (0.05)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Massachusetts
      - Hampshire County > Amherst (0.04)
      - Middlesex County > Cambridge (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
  - Canada
    - Quebec > Montreal (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.14)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Spain > Galicia
    - Madrid (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Thailand > Bangkok
    - Bangkok (0.04)
  - Taiwan > Taiwan Province
    - Taipei (0.04)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)
  - Japan > Kyūshū & Okinawa
    - Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)

Genre:
- Overview (1.00)
- Research Report
  - New Finding (1.00)
  - Experimental Study (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning (1.00)
  - Natural Language
    - Large Language Model (1.00)
    - Information Retrieval (1.00)
    - Generation (0.93)