LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs

Islam, K M Sajjadul, Nipu, Ayesha Siddika, Wu, Jiawei, Madiraju, Praveen

May-27-2025–arXiv.org Artificial Intelligence

--Electronic Health Records (EHRs) are digital records of patient information, often containing unstructured clinical text. Named Entity Recognition (NER) is essential in EHRs for extracting key medical entities like problems, tests, and treatments to support downstream clinical applications. This paper explores prompt-based medical entity recognition using large language models (LLMs), specifically GPT -4o and DeepSeek-R1, guided by various prompt engineering techniques, including zero-shot, few-shot, and an ensemble approach. Among all strategies, GPT -4o with prompt ensemble achieved the highest classification performance with an F1-score of 0.95 and recall of 0.98, outperforming DeepSeek-R1 on the task. The ensemble method improved reliability by aggregating outputs through embedding-based similarity and majority voting. Electronic Health Records (EHRs) are digital systems that store and manage patient information across clinical encounters. Initially designed to support administrative tasks like billing and scheduling, EHRs have now become essential tools in healthcare delivery and research. Over the past two decades, adoption of EHRs has grown rapidly, especially in the United States [1].

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

May-27-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States > Wisconsin (0.14)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found