RadEdit: stress-testing biomedical vision models via diffusion image editing

Pérez-García, Fernando, Bond-Taylor, Sam, Sanchez, Pedro P., van Breugel, Boris, Castro, Daniel C., Sharma, Harshita, Salvatelli, Valentina, Wetscherek, Maria T. A., Richardson, Hannah, Lungren, Matthew P., Nori, Aditya, Alvarez-Valle, Javier, Oktay, Ozan, Ilse, Maximilian

Dec-21-2023–arXiv.org Artificial Intelligence

Biomedical imaging datasets are often small and biased, meaning that real-world performance of predictive models can be substantially lower than expected from internal testing. This work proposes using generative image editing to simulate dataset shifts and diagnose failure modes of biomedical vision models; this can be used in advance of deployment to assess readiness, potentially reducing cost and patient harm. Existing editing methods can produce undesirable changes, with spurious correlations learned due to the co-occurrence of disease and treatment interventions, limiting practical applicability. To address this, we train a text-to-image diffusion model on multiple chest X-ray datasets and introduce a new editing method RadEdit that uses multiple masks, if present, to constrain changes and ensure consistency in the edited images. We consider three types of dataset shifts: acquisition shift, manifestation shift, and population shift, and demonstrate that our approach can diagnose failures and quantify model robustness without additional data collection, complementing more qualitative tools for explainable AI.

dataset, editing, radedit, (14 more...)

arXiv.org Artificial Intelligence

Dec-21-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California (0.04)
  - New Mexico > Bernalillo County
    - Albuquerque (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)

Genre:
- Research Report > Experimental Study (0.34)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (1.00)
  - Nuclear Medicine (0.94)
  - Therapeutic Area > Pulmonary/Respiratory Diseases (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)