Non-Imaging Medical Data Synthesis for Trustworthy AI: A Comprehensive Survey
Xing, Xiaodan, Wu, Huanjun, Wang, Lichao, Stenson, Iain, Yong, May, Del Ser, Javier, Walsh, Simon, Yang, Guang
–arXiv.org Artificial Intelligence
Data quality is the key factor for the development of trustworthy AI in healthcare. A large volume of curated datasets with controlled confounding factors can help improve the accuracy, robustness and privacy of downstream AI algorithms. However, access to good quality datasets is limited by the technical difficulty of data acquisition and large-scale sharing of healthcare data is hindered by strict ethical restrictions. Data synthesis algorithms, which generate data with a similar distribution as real clinical data, can serve as a potential solution to address the scarcity of good quality data during the development of trustworthy AI. However, state-of-the-art data synthesis algorithms, especially deep learning algorithms, focus more on imaging data while neglecting the synthesis of non-imaging healthcare data, including clinical measurements, medical signals and waveforms, and electronic healthcare records (EHRs). Thus, in this paper, we will review the synthesis algorithms, particularly for non-imaging medical data, with the aim of providing trustworthy AI in this domain. This tutorial-styled review paper will provide comprehensive descriptions of non-imaging medical data synthesis on aspects including algorithms, evaluations, limitations and future research directions.
arXiv.org Artificial Intelligence
Sep-17-2022
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.14)
- North America
- United States
- Wisconsin (0.04)
- Hawaii (0.04)
- Indiana > Marion County
- Indianapolis (0.04)
- Texas
- Travis County > Austin (0.04)
- Dallas County > Dallas (0.04)
- Maryland > Montgomery County
- Bethesda (0.04)
- Massachusetts > Suffolk County
- Boston (0.14)
- Rhode Island > Providence County
- Providence (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Washington > King County
- Seattle (0.04)
- Tennessee > Shelby County
- Memphis (0.04)
- California
- Santa Clara County > San Jose (0.04)
- Los Angeles County
- Los Angeles (0.14)
- Long Beach (0.04)
- New York > New York County
- New York City (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- Austria > Vienna (0.14)
- Switzerland (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Spain
- Valencian Community > Valencia Province
- Valencia (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Valencian Community > Valencia Province
- United Kingdom > England
- Greater London > London (0.04)
- Italy > Veneto
- Venice (0.04)
- Belgium > Flanders
- Antwerp Province > Antwerp (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- South Korea > Busan
- Busan (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Japan > Honshū
- Kantō > Kanagawa Prefecture > Yokohama (0.04)
- China
- Zhejiang Province > Hangzhou (0.04)
- Hong Kong (0.04)
- Taiwan > Taiwan Province
- Oceania > Australia
- Genre:
- Research Report > Experimental Study (1.00)
- Overview (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Health & Medicine
- Government Relations & Public Policy (1.00)
- Consumer Health (1.00)
- Health Care Technology > Medical Record (0.93)
- Health Care Providers & Services > Reimbursement (0.92)
- Diagnostic Medicine > Imaging (0.67)
- Therapeutic Area
- Oncology (1.00)
- Neurology (1.00)
- Cardiology/Vascular Diseases (1.00)
- Endocrinology (0.67)
- Government > Regional Government
- Technology:
- Information Technology
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Representation & Reasoning > Uncertainty (1.00)
- Issues > Social & Ethical Issues (1.00)
- Machine Learning
- Statistical Learning (1.00)
- Performance Analysis > Accuracy (1.00)
- Neural Networks > Deep Learning (1.00)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (1.00)
- Information Technology