Evaluation of Large Language Models for Summarization Tasks in the Medical Domain: A Narrative Review

Croxford, Emma, Gao, Yanjun, Pellegrino, Nicholas, Wong, Karen K., Wills, Graham, First, Elliot, Liao, Frank J., Goswami, Cherodeep, Patterson, Brian, Afshar, Majid

Sep-26-2024–arXiv.org Artificial Intelligence

Large Language Models have advanced clinical Natural Language Generation, creating opportunities to manage the volume of medical text. However, the high-stakes nature of medicine requires reliable evaluation, which remains a challenge. In this narrative review, we assess the current evaluation state for clinical summarization tasks and propose future directions to address the resource constraints of expert human evaluation.

aclanthology, computational linguistic, evaluation, (12 more...)

arXiv.org Artificial Intelligence

Sep-26-2024

arXiv.org PDF

Add feedback

Country:
- South America > Colombia
  - Meta Department > Villavicencio (0.04)
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - Wisconsin > Dane County
      - Madison (0.14)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Michigan > Washtenaw County
      - Ann Arbor (0.04)
    - Massachusetts
      - Suffolk County > Boston (0.04)
      - Middlesex County > Cambridge (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Colorado > Adams County
      - Aurora (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Canada
    - Ontario > Toronto (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.14)
- Europe
  - Germany > Berlin (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Italy
    - Tuscany > Florence (0.04)
    - Liguria > Genoa (0.04)
    - Trentino-Alto Adige/Südtirol > Trentino Province
      - Trento (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Switzerland > Basel-City
    - Basel (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - South Korea (0.04)
  - Singapore (0.04)
  - China > Hong Kong (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - Middle East > Qatar
    - Ad-Dawhah > Doha (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine
  - Diagnostic Medicine (0.46)
  - Health Care Providers & Services (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Generation (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found