Variational Open-Domain Question Answering
Liévin, Valentin, Motzfeldt, Andreas Geert, Jensen, Ida Riis, Winther, Ole
–arXiv.org Artificial Intelligence
Retrieval-augmented models have proven to be effective in natural language processing tasks, yet there remains a lack of research on their optimization using variational inference. We introduce the Variational Open-Domain (VOD) framework for end-to-end training and evaluation of retrieval-augmented models, focusing on open-domain question answering and language modelling. The VOD objective, a self-normalized estimate of the R\'enyi variational bound, approximates the task marginal likelihood and is evaluated under samples drawn from an auxiliary sampling distribution (cached retriever and/or approximate posterior). It remains tractable, even for retriever distributions defined on large corpora. We demonstrate VOD's versatility by training reader-retriever BERT-sized models on multiple-choice medical exam questions. On the MedMCQA dataset, we outperform the domain-tuned Med-PaLM by +5.3% despite using 2.500$\times$ fewer parameters. Our retrieval-augmented BioLinkBERT model scored 62.9% on the MedMCQA and 55.0% on the MedQA-USMLE. Last, we show the effectiveness of our learned retriever component in the context of medical semantic search.
arXiv.org Artificial Intelligence
May-31-2023
- Country:
- Africa > Middle East (0.04)
- North America > United States
- Pennsylvania (0.04)
- New York > New York County
- New York City (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Europe
- Middle East (0.04)
- Italy
- Tuscany > Florence (0.04)
- Calabria > Catanzaro Province
- Catanzaro (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Asia
- Middle East > Jordan (0.04)
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Diagnostic Medicine (1.00)
- Therapeutic Area
- Neurology (1.00)
- Infections and Infectious Diseases (1.00)
- Gastroenterology (0.92)
- Rheumatology (0.67)
- Musculoskeletal (0.67)
- Psychiatry/Psychology (0.67)
- Education > Curriculum
- Subject-Specific Education (0.67)
- Health & Medicine
- Technology: