A Survey of Spanish Clinical Language Models
Subies, Guillem García, Jiménez, Álvaro Barbero, Fernández, Paloma Martínez
–arXiv.org Artificial Intelligence
This survey focuses in encoder Language Models for solving tasks in the clinical domain in the Spanish language. We review the contributions of 17 corpora focused mainly in clinical tasks, then list the most relevant Spanish Language Models and Spanish Clinical Language models. We perform a thorough comparison of these models by benchmarking them over a curated subset of the available corpora, in order to find the best-performing ones; in total more than 3000 models were fine-tuned for this study. All the tested corpora and the best models are made publically available in an accessible way, so that the results can be reproduced by independent teams or challenged in the future when new Spanish Clinical Language models are created.
arXiv.org Artificial Intelligence
Aug-4-2023
- Country:
- South America > Chile
- North America
- Montserrat (0.04)
- Dominican Republic (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- Texas > Dallas County
- Dallas (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Washington > King County
- Europe
- Spain
- Galicia > Madrid (0.04)
- Valencian Community > Valencia Province
- Valencia (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Île-de-France
- Spain
- Asia > Middle East
- Jordan (0.04)
- Genre:
- Research Report (1.00)
- Overview (1.00)
- Industry:
- Information Technology > Security & Privacy (0.68)
- Health & Medicine
- Therapeutic Area (1.00)
- Diagnostic Medicine (0.93)
- Health Care Technology > Medical Record (0.46)
- Technology: