Taubmann, Oliver
Handling Label Uncertainty on the Example of Automatic Detection of Shepherd's Crook RCA in Coronary CT Angiography
Denzinger, Felix, Wels, Michael, Taubmann, Oliver, Kordon, Florian, Wagner, Fabian, Mehltretter, Stephanie, Gülsün, Mehmet A., Schöbinger, Max, André, Florian, Buss, Sebastian, Görich, Johannes, Sühling, Michael, Maier, Andreas
Coronary artery disease (CAD) is often treated minimally invasively with a catheter being inserted into the diseased coronary vessel. If a patient exhibits a Shepherd's Crook (SC) Right Coronary Artery (RCA) - an anatomical norm variant of the coronary vasculature - the complexity of this procedure is increased. Automated reporting of this variant from coronary CT angiography screening would ease prior risk assessment. We propose a 1D convolutional neural network which leverages a sequence of residual dilated convolutions to automatically determine this norm variant from a prior extracted vessel centerline. As the SC RCA is not clearly defined with respect to concrete measurements, labeling also includes qualitative aspects. Therefore, 4.23% samples in our dataset of 519 RCA centerlines were labeled as unsure SC RCAs, with 5.97% being labeled as sure SC RCAs. We explore measures to handle this label uncertainty, namely global/model-wise random assignment, exclusion, and soft label assignment. Furthermore, we evaluate how this uncertainty can be leveraged for the determination of a rejection class. With our best configuration, we reach an area under the receiver operating characteristic curve (AUC) of 0.938 on confident labels. Moreover, we observe an increase of up to 0.020 AUC when rejecting 10% of the data and leveraging the labeling uncertainty information in the exclusion process.
CAD-RADS Scoring using Deep Learning and Task-Specific Centerline Labeling
Denzinger, Felix, Wels, Michael, Taubmann, Oliver, Gülsün, Mehmet A., Schöbinger, Max, André, Florian, Buss, Sebastian J., Görich, Johannes, Sühling, Michael, Maier, Andreas, Breininger, Katharina
With coronary artery disease (CAD) persisting to be one of the leading causes of death worldwide, interest in supporting physicians with algorithms to speed up and improve diagnosis is high. In clinical practice, the severeness of CAD is often assessed with a coronary CT angiography (CCTA) scan and manually graded with the CAD-Reporting and Data System (CAD-RADS) score. The clinical questions this score assesses are whether patients have CAD or not (rule-out) and whether they have severe CAD or not (hold-out). In this work, we reach new state-of-the-art performance for automatic CAD-RADS scoring. We propose using severity-based label encoding, test time augmentation (TTA) and model ensembling for a task-specific deep learning architecture. Furthermore, we introduce a novel task- and model-specific, heuristic coronary segment labeling, which subdivides coronary trees into consistent parts across patients. It is fast, robust, and easy to implement. We were able to raise the previously reported area under the receiver operating characteristic curve (AUC) from 0.914 to 0.942 in the rule-out and from 0.921 to 0.950 in the hold-out task respectively.
Explaining Clinical Decision Support Systems in Medical Imaging using Cycle-Consistent Activation Maximization
Katzmann, Alexander, Taubmann, Oliver, Ahmad, Stephen, Mühlberg, Alexander, Sühling, Michael, Groß, Horst-Michael
This includes applications in microscopy and histopathology [1, 2], time-continuous biosignal analysis [3, 4], and, quite prominently, medical image analysis for volumetric imaging data as generated by computed tomography [5, 6], positron emission tomography [7, 8] or magnetic resonance imaging [9, 10, 11]. In the field of medical imaging, recent work has demonstrated a variety of applications for DNNs, such as organ segmentation [12], anomaly detection [13], lesion detection [14], segmentation [15] and assessment [16], providing major advantages and even repeatedly outperforming gold-standard human assessment [17]. A nearby field of similarly growing research interest established with the publications of Kumar et al. and Aerts et al. [18, 19] is,,Radiomics" using traditional machine learning (ML) techniques. Compared to deep learning techniques, traditional ML methods like random forests and support vector machines have a largely transparent decision-making process, which is generally easier to comprehend and/or depict - a clear argument for their preference in clinical practice. Many publications have shown the advantages of DNNs in comparison to traditional machine learning techniques, such as the ability to learn descriptive features from data instead of a complex and expensive handcrafted feature design, as well as an improved classification performance on medical imaging tasks [20, 21], with some architectures being on par with gold-standard human assessment [17]. However, as DNNs learn features from the given data, the semantic of these features is in general not immediately evident. Thus, clinicians understandably approach these methods with a high degree of skepticism.