DERA: Enhancing Large Language Model Completions with Dialog-Enabled Resolving Agents

Nair, Varun, Schumacher, Elliot, Tso, Geoffrey, Kannan, Anitha

Mar-29-2023–arXiv.org Artificial Intelligence

Large language models (LLMs) have emerged as valuable tools for many natural language understanding tasks. In safety-critical applications such as healthcare, the utility of these models is governed by their ability to generate outputs that are factually accurate and complete. In this work, we present dialog-enabled resolving agents (DERA). DERA is a paradigm made possible by the increased conversational abilities of LLMs, namely GPT-4. It provides a simple, interpretable forum for models to communicate feedback and iteratively improve output. We frame our dialog as a discussion between two agent types - a Researcher, who processes information and identifies crucial problem components, and a Decider, who has the autonomy to integrate the Researcher's information and makes judgments on the final output. We test DERA against three clinically-focused tasks. For medical conversation summarization and care plan generation, DERA shows significant improvement over the base GPT-4 performance in both human expert preference evaluations and quantitative metrics. In a new finding, we also show that GPT-4's performance (70%) on an open-ended version of the MedQA question-answering (QA) dataset (Jin et al. 2021, USMLE) is well above the passing level (60%), with DERA showing similar performance. We release the open-ended MEDQA dataset at https://github.com/curai/curai-research/tree/main/DERA.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Mar-29-2023

arXiv.org PDF

Add feedback

Country:
- Asia (0.28)
- North America > United States (0.46)

Genre:
- Research Report > New Finding (0.87)

Industry:
- Health & Medicine
  - Consumer Health (1.00)
  - Diagnostic Medicine (1.00)
  - Pharmaceuticals & Biotechnology (1.00)
  - Therapeutic Area
    - Gastroenterology (0.69)
    - Immunology (1.00)
    - Infections and Infectious Diseases (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found