Evaluation of AI Chatbots for Patient-Specific EHR Questions
–arXiv.org Artificial Intelligence
This paper investigates the use of artificial intelligence chatbots for patient-specific question answering (QA) from clinical notes using several large language model (LLM) based systems: ChatGPT (versions 3.5 and 4), Google Bard, and Claude. We evaluate the accuracy, relevance, comprehensiveness, and coherence of the answers generated by each model using a 5-point Likert scale on a set of patient-specific questions.
arXiv.org Artificial Intelligence
Jun-4-2023
- Country:
- North America
- United States
- Texas (0.04)
- Michigan (0.04)
- Massachusetts (0.04)
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe > Italy
- Calabria > Catanzaro Province > Catanzaro (0.04)
- Asia
- Pakistan (0.04)
- Middle East > Jordan (0.04)
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- North America
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.69)
- Research Report
- Industry:
- Technology: