Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
Kim, To Eun, Salemi, Alireza, Drozdov, Andrew, Diaz, Fernando, Zamani, Hamed
–arXiv.org Artificial Intelligence
In the field of language modeling, models augmented with retrieval components have emerged as a promising solution to address several challenges faced in the natural language processing (NLP) field, including knowledge grounding, interpretability, and scalability. Despite the primary focus on NLP, we posit that the paradigm of retrieval-enhancement can be extended to a broader spectrum of machine learning (ML) such as computer vision, time series prediction, and computational biology. Therefore, this work introduces a formal framework of this paradigm, Retrieval-Enhanced Machine Learning (REML), by synthesizing the literature in various domains in ML with consistent notations which is missing from the current literature. Also, we found that while a number of studies employ retrieval components to augment their models, there is a lack of integration with foundational Information Retrieval (IR) research. We bridge this gap between the seminal IR research and contemporary REML studies by investigating each component that comprises the REML framework. Ultimately, the goal of this work is to equip researchers across various disciplines with a comprehensive, formally structured framework of retrieval-enhanced models, thereby fostering interdisciplinary future research.
arXiv.org Artificial Intelligence
Jul-17-2024
- Country:
- South America > Brazil
- Oceania > Australia
- Victoria > Melbourne (0.04)
- Queensland (0.04)
- North America
- Dominican Republic (0.04)
- United States
- District of Columbia > Washington (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Maryland > Montgomery County
- Gaithersburg (0.04)
- Florida > Hillsborough County
- University (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Missouri > Jackson County
- Kansas City (0.14)
- Georgia > Fulton County
- Atlanta (0.04)
- Washington > King County
- Seattle (0.04)
- Massachusetts
- Hampshire County > Amherst (0.14)
- Middlesex County > Cambridge (0.04)
- Suffolk County > Boston (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Napa County (0.04)
- New York > New York County
- New York City (0.05)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- Canada
- Europe
- Middle East > Malta (0.04)
- Spain
- Galicia > Madrid (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Italy
- Tuscany > Florence (0.04)
- Piedmont > Turin Province
- Turin (0.04)
- Calabria > Catanzaro Province
- Catanzaro (0.04)
- Finland > Pirkanmaa
- Tampere (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Cambridgeshire
- Cambridge (0.04)
- Scotland > City of Edinburgh
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Germany > Saarland
- Saarbrücken (0.04)
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Japan
- Kyūshū & Okinawa > Kyūshū
- Miyazaki Prefecture > Miyazaki (0.04)
- Honshū > Kantō
- Tokyo Metropolis Prefecture > Tokyo (0.27)
- Kyūshū & Okinawa > Kyūshū
- China
- Taiwan > Taiwan Province
- Genre:
- Research Report > Promising Solution (0.34)
- Industry:
- Education (1.00)
- Health & Medicine > Pharmaceuticals & Biotechnology (0.65)
- Technology: