Dis-AE: Multi-domain & Multi-task Generalisation on Real-World Clinical Data

Kreuter, Daniel, Tull, Samuel, Gilbey, Julian, Preller, Jacobus, Consortium, BloodCounts!, Aston, John A. D., Rudd, James H. F., Sivapalaratnam, Suthesh, Schönlieb, Carola-Bibiane, Gleadall, Nicholas, Roberts, Michael

Jun-15-2023–arXiv.org Artificial Intelligence

Machine learning has promised to revolutionise healthcare for several years [1, 2]. Moreover, while there is an extensive literature describing high-performing machine learning models trained on immaculate benchmark datasets [3-5], such promising approaches rarely make it into clinical practice [6]. Often, this is because of an unexpected drop in performance when deploying the model on unseen test data due to domain shift [7, 8], i.e. there is a change in the data distribution between the dataset a model is trained on (source data) and that which it is deployed against (target data). Most common machine learning algorithms rely on an assumption that the source and target data are independent and identically distributed (i.i.d.) [9]. However, with domain shift, this assumption no longer holds, and model performance can be significantly affected. For medical datasets, domain shift is widespread, resulting from differences in equipment and clinical practice between sites [10-13], and models are vulnerable to associating clinically irrelevant features specific to the domain with their predictions, known as shortcut learning [14], which may lead to poor performance on target data. For most medical applications, target data is rarely available prior to real-time deployment; thus, a domain adaptation approach, where pre-trained models are fine-tuned on data from the target distribution is not feasible.

artificial intelligence, machine learning, springer nature 2021, (18 more...)

arXiv.org Artificial Intelligence

Jun-15-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - New Jersey > Hudson County
    - Hoboken (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.28)
    - Greater London > London (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.93)

Industry:
- Health & Medicine
  - Diagnostic Medicine (1.00)
  - Therapeutic Area > Immunology (0.93)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found