UTSA-NLP at ArchEHR-QA 2025: Improving EHR Question Answering via Self-Consistency Prompting
Shields-Menard, Sara, Reimers, Zach, Gardner, Joshua, Perry, David, Rios, Anthony
–arXiv.org Artificial Intelligence
We describe our system for the ArchEHR-QA Shared Task on answering clinical questions using electronic health records (EHRs). Our approach uses large language models in two steps: first, to find sentences in the EHR relevant to a clinician's question, and second, to generate a short, citation-supported response based on those sentences. We use few-shot prompting, self-consistency, and thresholding to improve the sentence classification step to decide which sentences are essential. We compare several models and find that a smaller 8B model performs better than a larger 70B model for identifying relevant information. Our results show that accurate sentence selection is critical for generating high-quality responses and that self-consistency with thresholding helps make these decisions more reliable.
arXiv.org Artificial Intelligence
Jun-9-2025
- Country:
- Europe > Austria
- Vienna (0.14)
- North America > United States
- Texas (0.04)
- Europe > Austria
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Technology: