Potential limitations in COVID-19 machine learning due to data source variability: A case study in the nCov2019 dataset

Nov-16-2020, 13:22:07 GMT–#artificialintelligence

The lack of representative coronavirus disease 2019 (COVID-19) data is a bottleneck for reliable and generalizable machine learning. Data sharing is insufficient without data quality, in which source variability plays an important role. We showcase and discuss potential biases from data source variability for COVID-19 machine learning. We used the publicly available nCov2019 dataset, including patient-level data from several countries. We aimed to the discovery and classification of severity subgroups using symptoms and comorbidities. Cases from the 2 countries with the highest prevalence were divided into separate subgroups with distinct severity manifestations.

comorbidity, subgroup, variability, (15 more...)

#artificialintelligence

Nov-16-2020, 13:22:07 GMT

News Web Page

Add feedback

Country:
- North America > United States (0.04)
- Europe
  - United Kingdom (0.04)
  - Italy (0.04)
- Asia
  - Philippines (0.08)
  - China (0.07)

Genre:
- Research Report
  - Experimental Study (0.68)
  - New Finding (0.47)

Industry:
- Health & Medicine > Therapeutic Area
  - Infections and Infectious Diseases (1.00)
  - Immunology (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found