A methodology for clinically driven interactive segmentation evaluation
Esmaeili, Parhom, Fernandez, Virginia, Borges, Pedro, Gibson, Eli, Ourselin, Sebastien, Cardoso, M. Jorge
–arXiv.org Artificial Intelligence
Interactive segmentation is a promising strategy for building robust, generalisable algorithms for volumetric medical image segmentation. However, inconsistent and clinically unrealistic evaluation hinders fair comparison and misrepresents real-world performance. We propose a clinically grounded methodology for defining evaluation tasks and metrics, and built a software framework for constructing standardised evaluation pipelines. We evaluate state-of-the-art algorithms across heterogeneous and complex tasks and observe that (i) minimising information loss when processing user interactions is critical for model robustness, (ii) adaptive-zooming mechanisms boost robustness and speed convergence, (iii) performance drops if validation prompting behaviour/budgets differ from training, (iv) 2D methods perform well with slab-like images and coarse targets, but 3D context helps with large or irregularly shaped targets, (v) performance of non-medical-domain models (e.g. SAM2) degrades with poor contrast and complex shapes.
arXiv.org Artificial Intelligence
Oct-13-2025
- Country:
- North America > United States (0.28)
- Genre:
- Research Report (0.40)
- Industry:
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
- Technology: