Goto

Collaborating Authors

 Liao, Frank J.


Evaluation of Large Language Models for Summarization Tasks in the Medical Domain: A Narrative Review

arXiv.org Artificial Intelligence

Large Language Models have advanced clinical Natural Language Generation, creating opportunities to manage the volume of medical text. However, the high-stakes nature of medicine requires reliable evaluation, which remains a challenge. In this narrative review, we assess the current evaluation state for clinical summarization tasks and propose future directions to address the resource constraints of expert human evaluation.