An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing

Peng, Le, Luo, Gaoxiang, zhou, sicheng, chen, jiandong, Zhang, Rui, Xu, Ziyue, Sun, Ju

Nov-11-2023–arXiv.org Artificial Intelligence

Language models (LMs) such as BERT and GPT have revolutionized natural language processing (NLP). However, the medical field faces challenges in training LMs due to limited data access and privacy constraints imposed by regulations like the Health Insurance Portability and Accountability Act (HIPPA) and the General Data Protection Regulation (GDPR). Federated learning (FL) offers a decentralized solution that enables collaborative learning while ensuring data privacy. In this study, we evaluated FL on 2 biomedical NLP tasks encompassing 8 corpora using 6 LMs. Our results show that: 1) FL models consistently outperformed models trained on individual clients' data and sometimes performed comparably with models trained with polled data; 2) with the fixed number of total data, FL models training with more clients produced inferior performance but pre-trained transformer-based models exhibited great resilience.

dataset, learning, preprint, (15 more...)

arXiv.org Artificial Intelligence

Nov-11-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Philadelphia County
    - Philadelphia (0.14)
  - Minnesota > Hennepin County
    - Minneapolis (0.29)
  - California > Santa Clara County
    - Santa Clara (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)
- Health & Medicine > Health Care Technology
  - Medical Record (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (0.92)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found