MV-Swin-T: Mammogram Classification with Multi-view Swin Transformer
Sarker, Sushmita, Sarker, Prithul, Bebis, George, Tavakkoli, Alireza
–arXiv.org Artificial Intelligence
Traditional deep learning approaches for breast cancer classification has predominantly concentrated on single-view analysis. In clinical practice, however, radiologists concurrently examine all views within a mammography exam, leveraging the inherent correlations in these views to effectively detect tumors. Acknowledging the significance of multi-view analysis, some studies have introduced methods that independently process mammogram views, either through distinct convolutional branches or simple fusion strategies, inadvertently leading to a loss of crucial inter-view correlations. In this paper, we propose an innovative multi-view network exclusively based on transformers to address challenges in mammographic image classification. Our approach introduces a novel shifted window-based dynamic attention block, facilitating the effective integration of multi-view information and promoting the coherent transfer of this information between views at the spatial feature map level. Furthermore, we conduct a comprehensive comparative analysis of the performance and effectiveness of transformer-based models under diverse settings, employing the CBIS-DDSM and Vin-Dr Mammo datasets. Our code is publicly available at https://github.com/prithuls/MV-Swin-T
arXiv.org Artificial Intelligence
Feb-25-2024
- Country:
- North America > United States
- Nevada > Washoe County > Reno (0.04)
- Europe > France
- Grand Est > Bas-Rhin > Strasbourg (0.04)
- North America > United States
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (1.00)
- Therapeutic Area > Oncology
- Breast Cancer (0.59)
- Health & Medicine
- Technology: