Shit, Suprosanna
FedPID: An Aggregation Method for Federated Learning
Mächler, Leon, Grimberg, Gustav, Ezhov, Ivan, Nickel, Manuel, Shit, Suprosanna, Naccache, David, Paetzold, Johannes C.
This paper presents FedPID, our submission to the Federated Tumor Segmentation Challenge 2024 (FETS24). Inspired by Fed-CostWAvg and FedPIDAvg, our winning contributions to FETS21 and FETS2022, we propose an improved aggregation strategy for federated and collaborative learning. FedCostWAvg is a method that averages results by considering both the number of training samples in each group and how much the cost function decreased in the last round of training. This is similar to how the derivative part of a PID controller works. In FedPIDAvg, we also included the integral part that was missing. Another challenge we faced were vastly differing dataset sizes at each center. We solved this by assuming the sizes follow a Poisson distribution and adjusting the training iterations for each center accordingly. Essentially, this part of the method controls that outliers that require too much training time are less frequently used. Based on these contributions we now adapted FedPIDAvg by changing how the integral part is computed.
Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images
Zhang, Yundi, Chen, Chen, Shit, Suprosanna, Starck, Sophie, Rueckert, Daniel, Pan, Jiazhen
Cardiac Magnetic Resonance (CMR) imaging serves as the gold-standard for evaluating cardiac morphology and function. Typically, a multi-view CMR stack, covering short-axis (SA) and 2/3/4-chamber long-axis (LA) views, is acquired for a thorough cardiac assessment. However, efficiently streamlining the complex, high-dimensional 3D+T CMR data and distilling compact, coherent representation remains a challenge. In this work, we introduce a whole-heart self-supervised learning framework that utilizes masked imaging modeling to automatically uncover the correlations between spatial and temporal patches throughout the cardiac stacks. This process facilitates the generation of meaningful and well-clustered heart representations without relying on the traditionally required, and often costly, labeled data. The learned heart representation can be directly used for various downstream tasks. Furthermore, our method demonstrates remarkable robustness, ensuring consistent representations even when certain CMR planes are missing/flawed. We train our model on 14,000 unlabeled CMR data from UK BioBank and evaluate it on 1,000 annotated data. The proposed method demonstrates superior performance to baselines in tasks that demand comprehensive 3D+T cardiac information, e.g.
Enhancing Interpretability of Vertebrae Fracture Grading using Human-interpretable Prototypes
Sinhamahapatra, Poulami, Shit, Suprosanna, Sekuboyina, Anjany, Husseini, Malek, Schinz, David, Lenhart, Nicolas, Menze, Joern, Kirschke, Jan, Roscher, Karsten, Guennemann, Stephan
Vertebral fracture grading classifies the severity of vertebral fractures, which is a challenging task in medical imaging and has recently attracted Deep Learning (DL) models. Only a few works attempted to make such models human-interpretable despite the need for transparency and trustworthiness in critical use cases like DL-assisted medical diagnosis. Moreover, such models either rely on post-hoc methods or additional annotations. In this work, we propose a novel interpretable-by-design method, ProtoVerse, to find relevant sub-parts of vertebral fractures (prototypes) that reliably explain the model's decision in a human-understandable way. Specifically, we introduce a novel diversity-promoting loss to mitigate prototype repetitions in small datasets with intricate semantics. We have experimented with the VerSe'19 dataset and outperformed the existing prototype-based method. Further, our model provides superior interpretability against the post-hoc method.
Topologically faithful multi-class segmentation in medical images
Berger, Alexander H., Stucki, Nico, Lux, Laurin, Buergin, Vincent, Shit, Suprosanna, Banaszak, Anna, Rueckert, Daniel, Bauer, Ulrich, Paetzold, Johannes C.
Topological accuracy in medical image segmentation is a highly important property for downstream applications such as network analysis and flow modeling in vessels or cell counting. Recently, significant methodological advancements have brought well-founded concepts from algebraic topology to binary segmentation. However, these approaches have been underexplored in multi-class segmentation scenarios, where topological errors are common. We propose a general loss function for topologically faithful multi-class segmentation extending the recent Betti matching concept, which is based on induced matchings of persistence barcodes. We project the N-class segmentation problem to N single-class segmentation tasks, which allows us to use 1-parameter persistent homology making training of neural networks computationally feasible. We validate our method on a comprehensive set of four medical datasets with highly variant topological characteristics. Our loss formulation significantly enhances topological correctness in cardiac, cell, artery-vein, and Circle of Willis segmentation.
Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers
Berger, Alexander H., Lux, Laurin, Shit, Suprosanna, Ezhov, Ivan, Kaissis, Georgios, Menten, Martin J., Rueckert, Daniel, Paetzold, Johannes C.
Direct image-to-graph transformation is a challenging task that solves object detection and relationship prediction in a single model. Due to the complexity of this task, large training datasets are rare in many domains, which makes the training of large networks challenging. This data sparsity necessitates the establishment of pre-training strategies akin to the state-of-the-art in computer vision. In this work, we introduce a set of methods enabling cross-domain and cross-dimension transfer learning for image-to-graph transformers. We propose (1) a regularized edge sampling loss for sampling the optimal number of object relationships (edges) across domains, (2) a domain adaptation framework for image-to-graph transformers that aligns features from different domains, and (3) a simple projection function that allows us to pretrain 3D transformers on 2D input data. We demonstrate our method's utility in cross-domain and cross-dimension experiments, where we pretrain our models on 2D satellite images before applying them to vastly different target domains in 2D and 3D. Our method consistently outperforms a series of baselines on challenging benchmarks, such as retinal or whole-brain vessel graph extraction.
TopCoW: Benchmarking Topology-Aware Anatomical Segmentation of the Circle of Willis (CoW) for CTA and MRA
Yang, Kaiyuan, Musio, Fabio, Ma, Yihui, Juchler, Norman, Paetzold, Johannes C., Al-Maskari, Rami, Höher, Luciano, Li, Hongwei Bran, Hamamci, Ibrahim Ethem, Sekuboyina, Anjany, Shit, Suprosanna, Huang, Houjing, Waldmannstetter, Diana, Kofler, Florian, Navarro, Fernando, Menten, Martin, Ezhov, Ivan, Rueckert, Daniel, Vos, Iris, Ruigrok, Ynte, Velthuis, Birgitta, Kuijf, Hugo, Hämmerli, Julien, Wurster, Catherine, Bijlenga, Philippe, Westphal, Laura, Bisschop, Jeroen, Colombo, Elisa, Baazaoui, Hakim, Makmur, Andrew, Hallinan, James, Wiestler, Bene, Kirschke, Jan S., Wiest, Roland, Montagnon, Emmanuel, Letourneau-Guillon, Laurent, Galdran, Adrian, Galati, Francesco, Falcetta, Daniele, Zuluaga, Maria A., Lin, Chaolong, Zhao, Haoran, Zhang, Zehan, Ra, Sinyoung, Hwang, Jongyun, Park, Hyunjin, Chen, Junqiang, Wodzinski, Marek, Müller, Henning, Shi, Pengcheng, Liu, Wei, Ma, Ting, Yalçin, Cansu, Hamadache, Rachika E., Salvi, Joaquim, Llado, Xavier, Estrada, Uma Maria Lal-Trehan, Abramova, Valeriia, Giancardo, Luca, Oliver, Arnau, Liu, Jialu, Huang, Haibin, Cui, Yue, Lin, Zehang, Liu, Yusheng, Zhu, Shunzhi, Patel, Tatsat R., Tutino, Vincent M., Orouskhani, Maysam, Wang, Huayu, Mossa-Basha, Mahmud, Zhu, Chengcheng, Rokuss, Maximilian R., Kirchhoff, Yannick, Disch, Nico, Holzschuh, Julius, Isensee, Fabian, Maier-Hein, Klaus, Sato, Yuki, Hirsch, Sven, Wegener, Susanne, Menze, Bjoern
The Circle of Willis (CoW) is an important network of arteries connecting major circulations of the brain. Its vascular architecture is believed to affect the risk, severity, and clinical outcome of serious neuro-vascular diseases. However, characterizing the highly variable CoW anatomy is still a manual and time-consuming expert task. The CoW is usually imaged by two angiographic imaging modalities, magnetic resonance angiography (MRA) and computed tomography angiography (CTA), but there exist limited public datasets with annotations on CoW anatomy, especially for CTA. Therefore we organized the TopCoW Challenge in 2023 with the release of an annotated CoW dataset. The TopCoW dataset was the first public dataset with voxel-level annotations for thirteen possible CoW vessel components, enabled by virtual-reality (VR) technology. It was also the first large dataset with paired MRA and CTA from the same patients. TopCoW challenge formalized the CoW characterization problem as a multiclass anatomical segmentation task with an emphasis on topological metrics. We invited submissions worldwide for the CoW segmentation task, which attracted over 140 registered participants from four continents. The top performing teams managed to segment many CoW components to Dice scores around 90%, but with lower scores for communicating arteries and rare variants. There were also topological mistakes for predictions with high Dice scores. Additional topological analysis revealed further areas for improvement in detecting certain CoW components and matching CoW variant topology accurately. TopCoW represented a first attempt at benchmarking the CoW anatomical segmentation task for MRA and CTA, both morphologically and topologically.
Panoptica -- instance-wise evaluation of 3D semantic and instance segmentation maps
Kofler, Florian, Möller, Hendrik, Buchner, Josef A., de la Rosa, Ezequiel, Ezhov, Ivan, Rosier, Marcel, Mekki, Isra, Shit, Suprosanna, Negwer, Moritz, Al-Maskari, Rami, Ertürk, Ali, Vinayahalingam, Shankeeth, Isensee, Fabian, Pati, Sarthak, Rueckert, Daniel, Kirschke, Jan S., Ehrlich, Stefan K., Reinke, Annika, Menze, Bjoern, Wiestler, Benedikt, Piraud, Marie
This paper introduces panoptica, a versatile and performance-optimized package designed for computing instance-wise segmentation quality metrics from 2D and 3D segmentation maps. panoptica addresses the limitations of existing metrics and provides a modular framework that complements the original intersection over union-based panoptic quality with other metrics, such as the distance metric Average Symmetric Surface Distance. The package is open-source, implemented in Python, and accompanied by comprehensive documentation and tutorials. panoptica employs a three-step metrics computation process to cover diverse use cases. The efficacy of panoptica is demonstrated on various real-world biomedical datasets, where an instance-wise evaluation is instrumental for an accurate representation of the underlying clinical task. Overall, we envision panoptica as a valuable tool facilitating in-depth evaluation of segmentation methods.
Unlocking the Diagnostic Potential of ECG through Knowledge Transfer from Cardiac MRI
Turgut, Özgün, Müller, Philip, Hager, Paul, Shit, Suprosanna, Starck, Sophie, Menten, Martin J., Martens, Eimo, Rueckert, Daniel
The electrocardiogram (ECG) is a widely available diagnostic tool that allows for a cost-effective and fast assessment of the cardiovascular health. However, more detailed examination with expensive cardiac magnetic resonance (CMR) imaging is often preferred for the diagnosis of cardiovascular diseases. While providing detailed visualization of the cardiac anatomy, CMR imaging is not widely available due to long scan times and high costs. To address this issue, we propose the first self-supervised contrastive approach that transfers domain-specific information from CMR images to ECG embeddings. Our approach combines multimodal contrastive learning with masked data modeling to enable holistic cardiac screening solely from ECG data. In extensive experiments using data from 40,044 UK Biobank subjects, we demonstrate the utility and generalizability of our method. We predict the subject-specific risk of various cardiovascular diseases and determine distinct cardiac phenotypes solely from ECG data. In a qualitative analysis, we demonstrate that our learned ECG embeddings incorporate information from CMR image regions of interest. We make our entire pipeline publicly available, including the source code and pre-trained model weights.
blob loss: instance imbalance aware loss functions for semantic segmentation
Kofler, Florian, Shit, Suprosanna, Ezhov, Ivan, Fidon, Lucas, Horvath, Izabela, Al-Maskari, Rami, Li, Hongwei, Bhatia, Harsharan, Loehr, Timo, Piraud, Marie, Erturk, Ali, Kirschke, Jan, Peeken, Jan C., Vercauteren, Tom, Zimmer, Claus, Wiestler, Benedikt, Menze, Bjoern
Deep convolutional neural networks (CNN) have proven to be remarkably effective in semantic segmentation tasks. Most popular loss functions were introduced targeting improved volumetric scores, such as the Dice coefficient (DSC). By design, DSC can tackle class imbalance, however, it does not recognize instance imbalance within a class. As a result, a large foreground instance can dominate minor instances and still produce a satisfactory DSC. Nevertheless, detecting tiny instances is crucial for many applications, such as disease monitoring. For example, it is imperative to locate and surveil small-scale lesions in the follow-up of multiple sclerosis patients. We propose a novel family of loss functions, \emph{blob loss}, primarily aimed at maximizing instance-level detection metrics, such as F1 score and sensitivity. \emph{Blob loss} is designed for semantic segmentation problems where detecting multiple instances matters. We extensively evaluate a DSC-based \emph{blob loss} in five complex 3D semantic segmentation tasks featuring pronounced instance heterogeneity in terms of texture and morphology. Compared to soft Dice loss, we achieve 5% improvement for MS lesions, 3% improvement for liver tumor, and an average 2% improvement for microscopy segmentation tasks considering F1 score.
FedPIDAvg: A PID controller inspired aggregation method for Federated Learning
Mächler, Leon, Ezhov, Ivan, Shit, Suprosanna, Paetzold, Johannes C.
This paper presents FedPIDAvg, the winning submission to the Federated Tumor Segmentation Challenge 2022 (FETS22). Inspired by FedCostWAvg, our winning contribution to FETS21, we contribute an improved aggregation strategy for federated and collaborative learning. FedCostWAvg is a weighted averaging method that not only considers the number of training samples of each cluster but also the size of the drop of the respective cost function in the last federated round. This can be interpreted as the derivative part of a PID controller (proportional-integral-derivative controller). In FedPIDAvg, we further add the missing integral term. Another key challenge was the vastly varying size of data samples per center. We addressed this by modeling the data center sizes as following a Poisson distribution and choosing the training iterations per center accordingly. Our method outperformed all other submissions.