Data-Centric Foundation Models in Computational Healthcare: A Survey

Zhang, Yunkun, Gao, Jin, Tan, Zheling, Zhou, Lingfeng, Ding, Kexin, Zhou, Mu, Zhang, Shaoting, Wang, Dequan

Jan-4-2024–arXiv.org Artificial Intelligence

In computational healthcare [3, 72], FMs can handle a variety of clinical data with their appealing capabilities in logical reasoning and semantic understanding. Examples span fields in medical conversation [241, 316], patient health profiling [48], and treatment planning [192]. Moreover, given the strength in largescale data processing, FMs offer a shifting paradigm to assess real-world clinical data in the healthcare workflow rapidly and effectively [208, 261]. FM research places a sharp focus on the data-centric perspective [318]. First, FMs demonstrate the power of scale, where the enlarged model and data size permit FMs to capture vast amounts of information, thus increasing the pressing need of training data quantity [272]. Second, FMs encourage homogenization [21] as evidenced by their extensive adaptability to downstream tasks. High-quality data for FM training thus becomes critical since it can impact the performance of both pre-trained FM and downstream models. Therefore, addressing key data challenges is progressively recognized as a research priority.

bioinformatics, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

Jan-4-2024

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - Republic of Türkiye (0.14)
- Europe (1.00)
- North America > United States
  - North Carolina (0.14)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (1.00)

Industry:
- Health & Medicine
  - Consumer Health (1.00)
  - Diagnostic Medicine > Imaging (1.00)
  - Health Care Technology (1.00)
  - Nuclear Medicine (0.94)
  - Pharmaceuticals & Biotechnology (1.00)
  - Therapeutic Area
    - Cardiology/Vascular Diseases (1.00)
    - Neurology (1.00)
    - Oncology (1.00)
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks
      - Deep Learning > Generative AI (0.67)
    - Natural Language
      - Chatbot (1.00)
      - Large Language Model (1.00)
    - Representation & Reasoning > Information Fusion (0.68)
    - Vision > Image Understanding (0.67)
  - Biomedical Informatics > Clinical Informatics (0.87)
  - Communications > Social Media (0.92)
  - Data Science > Data Mining (1.00)
  - Information Management (1.00)
  - Security & Privacy (1.00)
  - Sensing and Signal Processing > Image Processing (1.00)