Are Clinical T5 Models Better for Clinical Text?

Li, Yahan, Harrigian, Keith, Zirikly, Ayah, Dredze, Mark

Dec-8-2024–arXiv.org Artificial Intelligence

Large language models with a transformer-based encoder/decoder architecture, such as T5, have become standard platforms for supervised tasks. To bring these technologies to the clinical domain, recent work has trained new or adapted existing models to clinical data. However, the evaluation of these clinical T5 models and comparison to other models has been limited. Are the clinical T5 models better choices than FLAN-tuned generic T5 models? Do they generalize better to new clinical domains that differ from the training sets? We comprehensively evaluate these models across several clinical tasks and domains. We find that clinical T5 models provide marginal improvements over existing models, and perform worse when evaluated on different domains. Our results inform future choices in developing clinical LLMs.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Dec-8-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > California (0.14)
  - Canada > Ontario
    - Toronto (0.04)
- Europe > France
  - Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
- Asia > Middle East
  - Jordan (0.04)
  - UAE > Abu Dhabi Emirate
    - Abu Dhabi (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Information Technology > Security & Privacy (0.92)
- Health & Medicine
  - Therapeutic Area (0.92)
  - Health Care Technology > Medical Record (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.87)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found