radimagenet
Improving Predictive Confidence in Medical Imaging via Online Label Smoothing
Choudhury, Kushan, Roy, Shubhrodeep, Chanda, Ankur, Biswas, Shubhajit, Kuiry, Somenath
Deep learning models, especially convolutional neural networks, have achieved impressive results in medical image classification. However, these models often produce overconfident predictions, which can undermine their reliability in critical healthcare settings. While traditional label smoothing offers a simple way to reduce such overconfidence, it fails to consider relationships between classes by treating all non-target classes equally. In this study, we explore the use of Online Label Smoothing (OLS), a dynamic approach that adjusts soft labels throughout training based on the model's own prediction patterns. We evaluate OLS on the large-scale RadImageNet dataset using three widely used architectures: ResNet-50, MobileNetV2, and VGG-19. Our results show that OLS consistently improves both Top-1 and Top-5 classification accuracy compared to standard training methods, including hard labels, conventional label smoothing, and teacher-free knowledge distillation. In addition to accuracy gains, OLS leads to more compact and well-separated feature embeddings, indicating improved representation learning. These findings suggest that OLS not only strengthens predictive performance but also enhances calibration, making it a practical and effective solution for developing trustworthy AI systems in the medical imaging domain.
Towards Optimal Convolutional Transfer Learning Architectures for Breast Lesion Classification and ACL Tear Detection
Frees, Daniel, Bolling, Moritz, Bhagirath, Aditri
Modern computer vision models have proven to be highly useful for medical imaging classification and segmentation tasks, but the scarcity of medical imaging data often limits the efficacy of models trained from scratch. Transfer learning has emerged as a pivotal solution to this, enabling the fine-tuning of high-performance models on small data. Mei et al. (2022) found that pre-training CNNs on a large dataset of radiologist-labeled images (RadImageNet) enhanced model performance on downstream tasks compared to ImageNet pretraining. The present work extends Mei et al. (2022) by conducting a comprehensive investigation to determine optimal CNN architectures for breast lesion malignancy detection and ACL tear detection, as well as performing statistical analysis to compare the effect of RadImageNet and ImageNet pre-training on downstream model performance. Our findings suggest that 1-dimensional convolutional classifiers with skip connections, ResNet50 pre-trained backbones, and partial backbone unfreezing yields optimal downstream medical classification performance. Our best models achieve AUCs of 0.9969 for ACL tear detection and 0.9641 for breast nodule malignancy detection, competitive with the results reported by Mei et al. (2022) and surpassing other previous works. We do not find evidence confirming RadImageNet pre-training to provide superior downstream performance for ACL tear and breast lesion classification tasks.
Source Matters: Source Dataset Impact on Model Robustness in Medical Imaging
Juodelyte, Dovile, Lu, Yucheng, Jiménez-Sánchez, Amelia, Bottazzi, Sabrina, Ferrante, Enzo, Cheplygina, Veronika
Transfer learning has become an essential part of medical imaging classification algorithms, often leveraging ImageNet weights. However, the domain shift from natural to medical images has prompted alternatives such as RadImageNet, often demonstrating comparable classification performance. However, it remains unclear whether the performance gains from transfer learning stem from improved generalization or shortcut learning. To address this, we investigate potential confounders -- whether synthetic or sampled from the data -- across two publicly available chest X-ray and CT datasets. We show that ImageNet and RadImageNet achieve comparable classification performance, yet ImageNet is much more prone to overfitting to confounders. We recommend that researchers using ImageNet-pretrained models reexamine their model robustness by conducting similar experiments. Our code and experiments are available at https://github.com/DovileDo/source-matters.
Revisiting Hidden Representations in Transfer Learning for Medical Imaging
Juodelyte, Dovile, Jiménez-Sánchez, Amelia, Cheplygina, Veronika
While a key component to the success of deep learning is the availability of massive amounts of training data, medical image datasets are often limited in diversity and size. Transfer learning has the potential to bridge the gap between related yet different domains. For medical applications, however, it remains unclear whether it is more beneficial to pre-train on natural or medical images. We aim to shed light on this problem by comparing initialization on ImageNet and RadImageNet on seven medical classification tasks. Our work includes a replication study, which yields results contrary to previously published findings. In our experiments, ResNet50 models pre-trained on ImageNet tend to outperform those trained on RadImageNet. To gain further insights, we investigate the learned representations using Canonical Correlation Analysis (CCA) and compare the predictions of the different models. Our results indicate that, contrary to intuition, ImageNet and RadImageNet may converge to distinct intermediate representations, which appear to diverge further during fine-tuning. Despite these distinct representations, the predictions of the models remain similar. Our findings show that the similarity between networks before and after fine-tuning does not correlate with performance gains, suggesting that the advantages of transfer learning might not solely originate from the reuse of features in the early layers of a convolutional neural network.
Moving from ImageNet to RadImageNet for Improved Transfer Learning and Generalizability
See also the article by Mei et al in this issue. Alexandre Cadrin-Chênevert, MD, BEng, is a diagnostic and interventional radiologist at CISSS Lanaudière and clinical professor at Laval University. He has previously served as chief of the medical imaging department. As a Kaggle competition master, he has successfully participated in many machine learning competitions. He is an early member of the Canadian Association of Radiologists (CAR) Artificial Intelligence (AI) Standing Committee. His current research interests include deep learning, computer vision, object detection, self-supervised learning, model generalizability, and public medical imaging datasets.
RadImageNet: Training AI Models With Radiologic vs. Photographic Images
Yang Yang, PhD, Zahi Fayad, PhD, Xueyan Mei, PhD, Timothy Deyer, PhD and colleagues from Icahn School of Medicine at Mount Sinai, University of Oklahoma, and Weill Cornell Medicine conducted a study to evaluate the performance of AI models pretrained on radiologic images compared to photographic images. They created a large-scale, diverse medical imaging dataset to generate CNNs trained only from radiologic images. This is a significant study because the researchers demonstrated that pretraining with radiologic images rather than photographic images may result in more effective transfer learning for radiology AI models. A paper detailing the study entitled RadImageNet: An Open Radiologic Deep Learning Research Dataset for Effective Transfer Learning was published in RSNA Radiology AI on July 27, 2022. Within 10 days of publication, the paper has been downloaded over 1,000 times.