Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation

Wang, Haonan, Cao, Peng, Liu, Xiaoli, Yang, Jinzhu, Zaiane, Osmar

Dec-23-2023–arXiv.org Artificial Intelligence

Most state-of-the-art methods for medical image segmentation adopt the encoder-decoder architecture. However, this U-shaped framework still has limitations in capturing the non-local multi-scale information with a simple skip connection. To solve the problem, we firstly explore the potential weakness of skip connections in U-Net on multiple segmentation tasks, and find that i) not all skip connections are useful, each skip connection has different contribution; ii) the optimal combinations of skip connections are different, relying on the specific datasets. Based on our findings, we propose a new segmentation framework, named UDTransNet, to solve three semantic gaps in U-Net. Specifically, we propose a Dual Attention Transformer (DAT) module for capturing the channel- and spatial-wise relationships to better fuse the encoder features, and a Decoder-guided Recalibration Attention (DRA) module for effectively connecting the DAT tokens and the decoder features to eliminate the inconsistency. Hence, both modules establish a learnable connection to solve the semantic gaps between the encoder and the decoder, which leads to a high-performance segmentation model for medical images. Comprehensive experimental results indicate that our UDTransNet produces higher evaluation scores and finer segmentation results with relatively fewer parameters over the state-of-the-art segmentation methods on different public datasets. Code: https://github.com/McGregorWwww/UDTransNet.

artificial intelligence, machine learning, segmentation, (18 more...)

arXiv.org Artificial Intelligence

Dec-23-2023

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.46)
- North America > Canada
  - Alberta (0.28)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (1.00)
  - Therapeutic Area (1.00)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (0.47)
  - Sensing and Signal Processing > Image Processing (1.00)