Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization
Zamani, Hamed, Bendersky, Michael
–arXiv.org Artificial Intelligence
This paper introduces Stochastic RAG--a novel approach for end-to-end optimization of retrieval-augmented generation (RAG) models that relaxes the simplifying assumptions of marginalization and document independence, made in most prior work. Stochastic RAG casts the retrieval process in RAG as a stochastic sampling without replacement process. Through this formulation, we employ straight-through Gumbel-top-k that provides a differentiable approximation for sampling without replacement and enables effective end-to-end optimization for RAG. We conduct extensive experiments on seven diverse datasets on a wide range of tasks, from open-domain question answering to fact verification to slot-filling for relation extraction and to dialogue systems. By applying this optimization method to a recent and effective RAG model, we advance state-of-the-art results on six out of seven datasets.
arXiv.org Artificial Intelligence
May-5-2024
- Country:
- Asia
- North Korea > Hwanghae-namdo
- Haeju (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- North Korea > Hwanghae-namdo
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- France (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Spain
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Galicia > Madrid (0.04)
- Catalonia > Barcelona Province
- United Kingdom > England
- West Midlands > Birmingham (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada > British Columbia
- United States
- California > Santa Clara County
- Mountain View (0.04)
- District of Columbia > Washington (0.05)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Hampshire County
- Amherst (0.14)
- New York > New York County
- New York City (0.05)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Washington > King County
- Seattle (0.04)
- California > Santa Clara County
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Technology: