LLM, Reporting In! Medical Information Extraction Across Prompting, Fine-tuning and Post-correction
Belmadani, Ikram, Hashemi, Parisa Nazari, Sebbag, Thomas, Favre, Benoit, Fortier, Guillaume, Quiniou, Solen, Morin, Emmanuel, Dufour, Richard
–arXiv.org Artificial Intelligence
This work presents our participation in the EvalLLM 2025 challenge on biomedical Named Entity Recognition (NER) and health event extraction in French (few-shot setting). For NER, we propose three approaches combining large language models (LLMs), annotation guidelines, synthetic data, and post-processing: (1) in-context learning (ICL) with GPT-4.1, incorporating automatic selection of 10 examples and a summary of the annotation guidelines into the prompt, (2) the universal NER system GLiNER, fine-tuned on a synthetic corpus and then verified by an LLM in post-processing, and (3) the open LLM LLaMA-3.1-8B-Instruct, fine-tuned on the same synthetic corpus. Event extraction uses the same ICL strategy with GPT-4.1, reusing the guideline summary in the prompt. Results show GPT-4.1 leads with a macro-F1 of 61.53% for NER and 15.02% for event extraction, highlighting the importance of well-crafted prompting to maximize performance in very low-resource scenarios.
arXiv.org Artificial Intelligence
Oct-7-2025
- Country:
- Asia > Middle East
- UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- Europe
- France
- Pays de la Loire > Loire-Atlantique
- Nantes (0.05)
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
- Marseille (0.04)
- Île-de-France (0.04)
- Pays de la Loire > Loire-Atlantique
- Italy > Piedmont
- Turin Province > Turin (0.04)
- France
- North America
- Canada > Ontario
- Toronto (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Washington > King County
- Seattle (0.04)
- New Mexico > Santa Fe County
- Canada > Ontario
- Asia > Middle East
- Genre:
- Research Report (0.70)
- Industry:
- Technology: