Ontology- and LLM-based Data Harmonization for Federated Learning in Healthcare
Kokash, Natallia, Wang, Lei, Gillespie, Thomas H., Belloum, Adam, Grosso, Paola, Quinney, Sara, Li, Lang, de Bono, Bernard
–arXiv.org Artificial Intelligence
The rise of electronic health records (EHRs) has unlocked new opportunities for medical research, but privacy regulations and data heterogeneity remain key barriers to large-scale machine learning. Federated learning (FL) enables collaborative modeling without sharing raw data, yet faces challenges in harmonizing diverse clinical datasets. This paper presents a two-step data alignment strategy integrating ontologies and large language models (LLMs) to support secure, privacy-preserving FL in healthcare, demonstrating its effectiveness in a real-world project involving semantic mapping of EHR data.
arXiv.org Artificial Intelligence
May-27-2025
- Country:
- Asia > India (0.04)
- Europe
- Germany (0.04)
- Netherlands > North Holland
- Amsterdam (0.41)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America > United States
- California (0.04)
- Indiana (0.04)
- Ohio (0.04)
- Oklahoma > Payne County
- Cushing (0.04)
- Genre:
- Overview (0.93)
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Industry:
- Health & Medicine
- Consumer Health (1.00)
- Health Care Technology > Medical Record (0.87)
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area (1.00)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine
- Technology: