[1]杨萍,陈立伟,王庆凤,等.融合卷积和Transformer的腹部多器官分割网络[J].计算机技术与发展,2024,34(09):47-54.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0161]
 YANG Ping,CHEN Li-wei,WANG Qing-feng,et al.Abdominal Multi Organ Segmentation Network Combining Convolution and Transformer[J].,2024,34(09):47-54.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0161]
点击复制

融合卷积和Transformer的腹部多器官分割网络()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
34
期数:
2024年09期
页码:
47-54
栏目:
媒体计算
出版日期:
2024-09-10

文章信息/Info

Title:
Abdominal Multi Organ Segmentation Network Combining Convolution and Transformer
文章编号:
1673-629X(2024)09-0047-08
作者:
杨萍1陈立伟1王庆凤1周莹2
1. 西南科技大学 计算机科学与技术学院,四川 绵阳 621000;2. 绵阳市中心医院 放射科,四川 绵阳 621000
Author(s):
YANG Ping1CHEN Li-wei1WANG Qing-feng1ZHOU Ying2
1. School of Computer Science and Technology,Southwest University of Science and Technology,Mianyang 621000,China;2. Radiology Department,Mianyang Center Hospital,Mianyang 621000,China
关键词:
医学图像分割特征融合多尺度空洞卷积Transformer多器官
Keywords:
medical image segmentationfeature fusionmulti scaledilated convolutionsTransformermultiple organs
分类号:
TP391
DOI:
10.20165/j.cnki.ISSN1673-629X.2024.0161
摘要:
腹部多器官分割在计算机辅助诊断中起着至关重要的作用,具有重要的研究价值。 但由于腹部多器官边界模糊、背景复杂以及形状大小多变,使这项任务极具挑战性。 为此,提出了一种融合卷积和 Transformer 的腹部多器官分割网络TCMSUnet。 首先,在特征提取阶段设计了多尺度引导融合模块(GFM),利用高层特征提取的显著语义信息来引导低层特征以增强相邻特征的语义一致性,从而促进不同尺度特征的融合;随后设计了全局局部增强模块(GLE),通过空洞卷积和 Transformer 块结合来增强模型对全局局部上下文信息的提取,使模型在建立长距离依赖关系的同时加强特征的局部关联性;最后,在解码器部分引入多阶段损失聚合结构以加快模型的收敛并优化模型的性能。 在 Synapse 数据集上评估了模型的性能,其平均 Dice 相似系数(DSC)为 81. 20% 。 实验结果表明,所提方法整体性能优于多种比较网络,并对形状大小多变的器官有更好的分割效果。
Abstract:
Abdominal multi organ segmentation plays a crucial role in computer - aided diagnosis and has significant research value.However,due to the blurred boundaries of multiple organs in the abdomen,complex backgrounds,and variable shapes and sizes,this task is extremely challenging. To this end,TCMSUnet,a new abdominal multi organ segmentation network that integrates convolution and Transformer is proposed. Firstly,a multi-scale guided fusion module (GFM) was designed in the feature extraction stage,which utilizes the significant semantic information extracted from high-level features to guide low-level features and enhance the semantic consistency of adjacent features, thereby promoting the fusion of features at different scales. Subsequently, a global local enhancement module (GLE) was designed to enhance the model’s extraction of global and local contextual information through a combination of dilated con-volution and Transformer blocks,enabling the model to establish long-range dependencies while enhancing local correlations of features.Finally,a multi-stage loss aggregation structure was introduced in the decoder section to accelerate the convergence of the model and optimize its performance. The performance of the model was evaluated on the Synapse dataset,with an average Dice similarity coefficient (DSC) of 81. 20% . The experimental results show that the proposed method outperforms multiple comparison networks in overall per-formance and has better segmentation performance for organs with variable shapes and sizes.

相似文献/References:

[1]王艳华 管一弘.基于模糊集理论的医学图像分割的应用[J].计算机技术与发展,2008,(11):223.
 WANG Yan-hua,GUAN Yi-hong.Application of Medical Image Segmentation Technology Based on Fuzzy - Set - Theory[J].,2008,(09):223.
[2]周伟 武港山.基于显著图的花卉图像分类算法研究[J].计算机技术与发展,2011,(11):15.
 ZHOU Wei,WU Gang-shan.Research on Saliency Map Based Flower Image Classification Algorithm[J].,2011,(09):15.
[3]黎粤华,单磊,田仲富,等. 基于多特征融合的视频烟雾检测[J].计算机技术与发展,2016,26(01):129.
 LI Yue-hua,SHAN Lei,TIAN Zhong-fu,et al. Video Smoke Detection Based on Multi Feature Fusion Technology[J].,2016,26(09):129.
[4]刘加运,李玉惠,李勃,等. 一种多维特征融合的车辆对象同一性匹配方法[J].计算机技术与发展,2016,26(04):167.
 LIU Jia-yun,LI Yu-hui,LI Bo,et al. A Vehicle Object Identity Matching Method of Multidimensional Feature Combination[J].,2016,26(09):167.
[5]陈浩翔,蔡建明,刘铿然,等. 手写数字深度特征学习与识别[J].计算机技术与发展,2016,26(07):19.
 CHEN Hao-xiang,CAI Jian-ming,LIU Keng-ran,et al. Deep Learning and Recognition of Handwritten Numeral Features[J].,2016,26(09):19.
[6]张雅倩,曾卫明,石玉虎.基于特征融合与稀疏表示的人耳识别[J].计算机技术与发展,2017,27(12):7.
 ZHANG Ya-qian,ZENG Wei-min,SHI Yu-hu.Ear Recognition Based on Feature Fusion and Sparse Representation[J].,2017,27(09):7.
[7]谭程午,夏利民,王 嘉.基于融合特征的群体行为识别[J].计算机技术与发展,2018,28(01):17.[doi:10.3969/ j. issn.1673-629X.2018.01.004]
 TAN Cheng-wu,XIA Li-min,WANG Jia.Recognition of Human Group Action Based on Fusion Features[J].,2018,28(09):17.[doi:10.3969/ j. issn.1673-629X.2018.01.004]
[8]王 敏,陈立潮,曹建芳,等.Hadoop 下自适应随机权值多特征融合图像分类[J].计算机技术与发展,2018,28(11):30.[doi:10.3969/ j. issn.1673-629X.2018.11.007]
 WANG Min,CHEN Li-chao,CAO Jian-fang,et al.Multi-feature Fusion Image Classification of Adaptive Random Weight Based on Hadoop[J].,2018,28(09):30.[doi:10.3969/ j. issn.1673-629X.2018.11.007]
[9]韩欣欣,叶奇玲.基于 SIFT 和 HOG 特征融合的人体行为识别方法[J].计算机技术与发展,2019,29(06):71.[doi:10. 3969 / j. issn. 1673-629X. 2019. 06. 015]
 HAN Xin-xin,YE Qi-ling.Human Action Recognition Based on Feature Fusion of SIFT and HOG[J].,2019,29(09):71.[doi:10. 3969 / j. issn. 1673-629X. 2019. 06. 015]
[10]宋相法,吕 明.融合三维骨架和深度图像特征的人体行为识别[J].计算机技术与发展,2019,29(07):55.[doi:10. 3969 / j. issn. 1673-629X. 2019. 07. 011]
 SONG Xiang-fa,LYU Ming.Human Activity Recognition Based on Fusing 3D Skeleton and Depth Image Feature[J].,2019,29(09):55.[doi:10. 3969 / j. issn. 1673-629X. 2019. 07. 011]

更新日期/Last Update: 2024-09-10