Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models

Verma, Khushboo, Moore, Marina, Wottrich, Stephanie, López, Karla Robles, Aggarwal, Nishant, Bhatt, Zeel, Singh, Aagamjit, Unroe, Bradford, Basheer, Salah, Sachdeva, Nitish, Arora, Prinka, Kaur, Harmanjeet, Kaur, Tanupreet, Hood, Tevon, Marquez, Anahi, Varshney, Tushar, Deng, Nanfu, Ramani, Azaan, Ishwara, Pawanraj, Saeed, Maimoona, Peña, Tatiana López Velarde, Barksdale, Bryan, Guha, Sushovan, Kumar, Satwant

Oct-17-2023–arXiv.org Artificial Intelligence

In response to the pressing need for advanced clinical problem-solving tools in healthcare, we introduce BooksMed, a novel framework based on a Large Language Model (LLM). BooksMed uniquely emulates human cognitive processes to deliver evidence-based and reliable responses, utilizing the GRADE (Grading of Recommendations, Assessment, Development, and Evaluations) framework to effectively quantify evidence strength. For clinical decision-making to be appropriately assessed, an evaluation metric that is clinically aligned and validated is required. As a solution, we present ExpertMedQA, a multispecialty clinical benchmark comprised of open-ended, expert-level clinical questions, and validated by a diverse group of medical professionals. By demanding an in-depth understanding and critical appraisal of up-to-date clinical literature, ExpertMedQA rigorously evaluates LLM performance. BooksMed outperforms existing state-of-the-art models Med-PaLM 2, Almanac, and ChatGPT in a variety of medical scenarios. Therefore, a framework that mimics human cognitive stages could be a useful tool for providing reliable and evidence-based responses to clinical inquiries.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Oct-17-2023

arXiv.org PDF

Add feedback

Country:
- Asia (1.00)
- Europe > United Kingdom
  - England > Nottinghamshire > Nottingham (0.14)
- North America > United States
  - Texas > Travis County > Austin (0.14)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (1.00)
  - Strength High (0.69)

Industry:
- Health & Medicine
  - Diagnostic Medicine (0.93)
  - Health Care Providers & Services (1.00)
  - Pharmaceuticals & Biotechnology (1.00)
  - Therapeutic Area
    - Cardiology/Vascular Diseases (1.00)
    - Infections and Infectious Diseases (1.00)
    - Neurology (0.69)
    - Oncology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.90)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found