LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs
Islam, K M Sajjadul, Nipu, Ayesha Siddika, Wu, Jiawei, Madiraju, Praveen
–arXiv.org Artificial Intelligence
--Electronic Health Records (EHRs) are digital records of patient information, often containing unstructured clinical text. Named Entity Recognition (NER) is essential in EHRs for extracting key medical entities like problems, tests, and treatments to support downstream clinical applications. This paper explores prompt-based medical entity recognition using large language models (LLMs), specifically GPT -4o and DeepSeek-R1, guided by various prompt engineering techniques, including zero-shot, few-shot, and an ensemble approach. Among all strategies, GPT -4o with prompt ensemble achieved the highest classification performance with an F1-score of 0.95 and recall of 0.98, outperforming DeepSeek-R1 on the task. The ensemble method improved reliability by aggregating outputs through embedding-based similarity and majority voting. Electronic Health Records (EHRs) are digital systems that store and manage patient information across clinical encounters. Initially designed to support administrative tasks like billing and scheduling, EHRs have now become essential tools in healthcare delivery and research. Over the past two decades, adoption of EHRs has grown rapidly, especially in the United States [1].
arXiv.org Artificial Intelligence
May-27-2025
- Country:
- North America > United States > Wisconsin (0.14)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: