MSA-TransUNet: A Multi-Scale Attention Enhanced Transformer-UNet Architecture for Accurate Vessel Segmentation and Visualization in Medical Imaging

Authors

  • Yilin Yao International Business School, Henan University, Zhengzhou, Henan, China Author
  • Yinghan Li International Business School, Henan University, Zhengzhou, Henan, China Author
  • Shirong Zheng Purdue University, West Lafayette, IN, USA Author
  • Taoyu Zhu Johns Hopkins University, Baltimore, MD, USA Author

DOI:

https://doi.org/10.71222/hg1xv533

Keywords:

vessel segmentation, medical image analysis, Transformer, attention mechanism, deep learning, TransUNet, multi-scale attention, vascular enhancement

Abstract

Accurate segmentation of vascular structures is critical for computer-aided diagnosis, surgical planning, and quantitative vascular analysis. Nevertheless, vessel segmentation remains a formidable challenge due to low contrast, high noise, intricate topology, and the pervasive presence of thin and tortuous vascular branches. To overcome these obstacles, we propose MSA-TransUNet, a novel hybrid architecture that combines convolutional neural networks (CNNs) with Transformer-based global modeling, augmented by a Multi-Scale Attention Module (MSAM) and a Vessel Enhancement Module (VEM) to enhance vessel representation. The MSAM selectively emphasizes vascular features across multiple receptive fields and strengthens both channel-wise and spatial responses, while the VEM introduces a learnable vascular enhancement mechanism inspired by traditional vesselness filtering techniques. In addition, a connectivity-aware loss function (clDice) is incorporated to preserve delicate vascular topology and minimize branch discontinuities. Experiments conducted on three publicly available vascular datasets, including DRIVE, STARE, and CHASE_DB1, demonstrate that MSA-TransUNet outperforms state-of-the-art segmentation models such as UNet, Attention-UNet, and TransUNet. The proposed approach achieves notable improvements in Dice coefficient, connectivity accuracy, and small-vessel recall. These results indicate that MSA-TransUNet offers a robust and effective solution for medical vascular segmentation and has potential to support clinical vessel visualization and diagnosis.

References

1. R. Azad, M. Asadi-Aghbolaghi, M. Fathy, and S. Escalera, "Bi-directional ConvLSTM U-Net with densley connected convolutions," In Proceedings of the IEEE/CVF international conference on computer vision workshops, 2019, pp. 0-0. doi: 10.1109/iccvw.2019.00052

2. X. F. Du, J. S. Wang, and W. Z. Sun, "UNet retinal blood vessel segmentation algorithm based on improved pyramid pooling method and attention mechanism," Physics in Medicine & Biology, vol. 66, no. 17, p. 175013, 2021.

3. X. Sun, J. Li, J. Ma, H. Xu, B. Chen, Y. Zhang, and T. Feng, "Segmentation of overlapping chromosome images using U-Net with improved dilated convolutions," Journal of Intelligent & Fuzzy Systems, vol. 40, no. 3, pp. 5653-5668, 2021. doi: 10.3233/jifs-201466

4. Y. Jiang, J. Liang, T. Cheng, X. Lin, Y. Zhang, and J. Dong, "MTPA_Unet: Multi-scale transformer-position attention retinal vessel segmentation network joint transformer and CNN," Sensors, vol. 22, no. 12, p. 4592, 2022. doi: 10.3390/s22124592

5. W. Jia, S. Ma, P. Geng, and Y. Sun, "DT-Net: Joint Dual-Input Transformer and CNN for Retinal Vessel Segmentation," Computers, Materials & Continua, vol. 76, no. 3, 2023. doi: 10.32604/cmc.2023.040091

6. Z. Shi, Y. Li, H. Zou, and X. Zhang, "Tcu-net: Transformer embedded in convolutional u-shaped network for retinal vessel segmentation," Sensors, vol. 23, no. 10, p. 4897, 2023. doi: 10.3390/s23104897

7. Y. Zhang, and A. C. Chung, "Retinal vessel segmentation by a transformer-u-net hybrid model with dual-path decoder," IEEE Journal of Biomedical and Health Informatics, vol. 28, no. 9, pp. 5347-5359, 2024.

8. T. H. Pham, X. Li, and K. D. Nguyen, "Seunet-trans: A simple yet effective unet-transformer model for medical image segmentation," IEEE Access, 2024.

9. Y. Wu, S. Qi, M. Wang, S. Zhao, H. Pang, J. Xu, and H. Ren, "Transformer-based 3D U-Net for pulmonary vessel segmentation and artery-vein separation from CT images," Medical & Biological Engineering & Computing, vol. 61, no. 10, pp. 2649-2663, 2023. doi: 10.1007/s11517-023-02872-5

10. S. Shit, J. C. Paetzold, A. Sekuboyina, I. Ezhov, A. Unger, A. Zhylka, and B. H. Menze, "clDice-a novel topology-preserving loss function for tubular structure segmentation," In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 16560-16569.

11. P. Rougé, N. Passat, and O. Merveille, "Cascaded multitask U-Net using topological loss for vessel segmentation and centerline extraction," arXiv preprint arXiv:2307.11603, 2023.a

12. L. Liang, Y. Yang, A. He, X. Dong, and J. Wu, "Cross-Layer Transformer and Multi-Scale Adaptive Fusion of Retinal Vascular Segmentation Algorithm," Journal of Computer-Aided Design & Computer Graphics, vol. 37, no. 3, pp. 495-505, 2025. doi: 10.3724/sp.j.1089.2023-00271

13. Q. Liu, F. Zhou, J. Shen, J. Xu, C. Wan, X. Xu, and J. Yao, "A fundus vessel segmentation method based on double skip connections combined with deep supervision," Frontiers in Cell and Developmental Biology, vol. 12, p. 1477819, 2024. doi: 10.3389/fcell.2024.1477819

Downloads

Published

03 December 2025

Issue

Section

Article

How to Cite

[1]
Y. Yao, Y. Li, S. Zheng, and T. Zhu , Trans., “MSA-TransUNet: A Multi-Scale Attention Enhanced Transformer-UNet Architecture for Accurate Vessel Segmentation and Visualization in Medical Imaging”, J. Med. Life Sci., vol. 1, no. 4, pp. 48–57, Dec. 2025, doi: 10.71222/hg1xv533.