AI chatbots fail to diagnose patients by talking with them

Jan-2-2025, 10:00:04 GMT–New Scientist

Advanced artificial intelligence models score well on professional medical exams but still flunk one of the most crucial physician tasks: talking with patients to gather relevant medical information and deliver an accurate diagnosis. "While large language models show impressive results on multiple-choice tests, their accuracy drops significantly in dynamic conversations," says Pranav Rajpurkar at Harvard University. That became evident when researchers developed a method for evaluating a clinical AI model's reasoning capabilities based on simulated doctor-patient conversations. The "patients" were based on 2000 medical cases primarily drawn from professional US medical board exams. "Simulating patient interactions enables the evaluation of medical history-taking skills, a critical component of clinical practice that cannot be assessed using case vignettes," says Shreya Johri, also at Harvard University.

benchmark, diagnosis, simulated patient conversation, (15 more...)

New Scientist

Jan-2-2025, 10:00:04 GMT

News Web Page

Add feedback

Country:
- North America > United States > California (0.06)

Industry:
- Health & Medicine > Diagnostic Medicine (0.32)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.66)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found