AI-driven software for automated quantification of skeletal metastases and treatment response evaluation using Whole-Body Diffusion-Weighted MRI (WB-DWI) in Advanced Prostate Cancer
Candito, Antonio, Blackledge, Matthew D, Holbrey, Richard, Porta, Nuria, Ribeiro, Ana, Zugni, Fabio, D'Erme, Luca, Castagnoli, Francesca, Dragan, Alina, Donners, Ricardo, Messiou, Christina, Tunariu, Nina, Koh, Dow-Mu
–arXiv.org Artificial Intelligence
Quantitative assessment of treatment response in Advanced Prostate Cancer (APC) with bone metastases remains an unmet clinical need. Whole-Body Diffusion-Weighted MRI (WB-DWI) provides two response biomarkers: Total Diffusion Volume (TDV) and global Apparent Diffusion Coefficient (gADC). However, tracking post-treatment changes of TDV and gADC from manually delineated lesions is cumbersome and increases inter-reader variability. We developed a software to automate this process. Core technologies include: (i) a weakly-supervised Residual U-Net model generating a skeleton probability map to isolate bone; (ii) a statistical framework for WB-DWI intensity normalisation, obtaining a signal-normalised b=900s/mm^2 (b900) image; and (iii) a shallow convolutional neural network that processes outputs from (i) and (ii) to generate a mask of suspected bone lesions, characterised by higher b900 signal intensity due to restricted water diffusion. This mask is applied to the gADC map to extract TDV and gADC statistics. We tested the tool using expert-defined metastatic bone disease delineations on 66 datasets, assessed repeatability of imaging biomarkers (N=10), and compared software-based response assessment with a construct reference standard (N=118). Average dice score between manual and automated delineations was 0.6 for lesions within pelvis and spine, with an average surface distance of 2mm. Relative differences for log-transformed TDV (log-TDV) and median gADC were 8.8% and 5%, respectively. Repeatability analysis showed coefficients of variation of 4.6% for log-TDV and 3.5% for median gADC, with intraclass correlation coefficients of 0.94 or higher. The software achieved 80.5% accuracy, 84.3% sensitivity, and 85.7% specificity in assessing response to treatment. Average computation time was 90s per scan.
arXiv.org Artificial Intelligence
Nov-5-2025
- Country:
- Europe
- Finland > Uusimaa
- Helsinki (0.04)
- Germany (0.04)
- Italy > Lazio
- Rome (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Finland > Uusimaa
- Europe
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (1.00)
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area > Oncology
- Bone Cancer (0.35)
- Prostate Cancer (0.36)
- Health & Medicine
- Technology: