RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis
Li, Haolin, Dai, Tianjie, Chen, Zhe, Du, Siyuan, Yao, Jiangchao, Zhang, Ya, Wang, Yanfeng
–arXiv.org Artificial Intelligence
Clinical diagnosis is a highly specialized discipline requiring both domain expertise and strict adherence to rigorous guidelines. While current AI-driven medical research predominantly focuses on knowledge graphs or natural text pretraining paradigms to incorporate medical knowledge, these approaches primarily rely on implicitly encoded knowledge within model parameters, neglecting task-specific knowledge required by diverse downstream tasks. To address this limitation, we propose Retrieval-Augmented Diagnosis (RAD), a novel framework that explicitly injects external knowledge into multimodal models directly on downstream tasks. Specifically, RAD operates through three key mechanisms: retrieval and refinement of disease-centered knowledge from multiple medical sources, a guideline-enhanced contrastive loss that constrains the latent distance between multi-modal features and guideline knowledge, and the dual transformer decoder that employs guidelines as queries to steer cross-modal fusion, aligning the models with clinical diagnostic workflows from guideline acquisition to feature extraction and decision-making. Moreover, recognizing the lack of quantitative evaluation of interpretability for multimodal diagnostic models, we introduce a set of criteria to assess the interpretability from both image and text perspectives. Extensive evaluations across four datasets with different anatomies demonstrate RAD's generalizability, achieving state-of-the-art performance. Furthermore, RAD enables the model to concentrate more precisely on abnormal regions and critical indicators, ensuring evidence-based, trustworthy diagnosis. Our code is available at https://github.com/tdlhl/RAD.
arXiv.org Artificial Intelligence
Dec-12-2025
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (1.00)
- Health Care Technology > Medical Record (1.00)
- Nuclear Medicine (0.93)
- Pharmaceuticals & Biotechnology (0.67)
- Therapeutic Area
- Cardiology/Vascular Diseases (1.00)
- Immunology (0.68)
- Infections and Infectious Diseases (1.00)
- Musculoskeletal (0.67)
- Neurology (1.00)
- Psychiatry/Psychology (1.00)
- Pulmonary/Respiratory Diseases (1.00)
- Health & Medicine
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language
- Chatbot (0.68)
- Large Language Model (1.00)
- Representation & Reasoning > Diagnosis (0.93)
- Vision (1.00)
- Machine Learning > Neural Networks
- Data Science (1.00)
- Sensing and Signal Processing > Image Processing (0.93)
- Artificial Intelligence
- Information Technology