[1]何亚鹏,刘立群.Skip-cycleGAN:一种果园苹果异源图像配准模型[J].计算机技术与发展,2024,34(07):40-47.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0104]
 HE Ya-peng,LIU Li-qun.Skip-cycleGAN:A Heterologous Image Registration Model for Orchard Apple[J].,2024,34(07):40-47.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0104]
点击复制

Skip-cycleGAN:一种果园苹果异源图像配准模型()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
34
期数:
2024年07期
页码:
40-47
栏目:
媒体计算
出版日期:
2024-07-10

文章信息/Info

Title:
Skip-cycleGAN:A Heterologous Image Registration Model for Orchard Apple
文章编号:
1673-629X(2024)07-0040-08
作者:
何亚鹏刘立群
甘肃农业大学 信息科学技术学院,甘肃 兰州 730070
Author(s):
HE Ya-pengLIU Li-qun
School of Information Science and Technology,Gansu Agricultural University,Lanzhou 730070,China
关键词:
图像配准异源图像生成对抗网络跳跃连接岭回归损失
Keywords:
image registrationheterologous imagesgenerative adversarial networkskip connectionridge regression loss
分类号:
TP301.6
DOI:
10.20165/j.cnki.ISSN1673-629X.2024.0104
摘要:
针对有监督的配准模型的性能受限于给定的标签以及循环一致性生成对抗网络训练不稳定,收敛速度较慢,易过拟合,对复杂场景的图像处理效果不佳的问题,基于循环一致性生成对抗网络从 3 个方面(生成器、鉴别器和损失函数)进行改进,提出一种无监督的异源图像配准模型。 生成网络的下采样与上采样之间引入带有特征转换残差层的跳跃连接,可以确保梯度的有效传递,减少前向与反向传播过程中信息损失,实现低级特征和高级特征的结合,从而缓解梯度消失和梯度爆炸,促进神经网络的收敛,有助于网络学习更多的上下文信息。 在一个自建果园苹果数据集和两个公共数据集上对模型进行评估,实验得出在改进后的生成器基础上,对于形变比较大的数据集选取 70×70 PatchGAN 鉴别器更合适,对于形变比较小的数据集选取 PixelGAN 鉴别器更合适。 与 8 个经典算法进行对比,用 6 个性能指标进行评估,实验结果表明该模型在异源果园苹果数据集上的综合表现优于对比算法。 未来将提升模型对异源图像亮度和对比度的鲁棒性,并进行轻量化模型的工作。
Abstract:
Aiming at the problems that the performance of the supervised registration model is limited by the given labels as well as the unstable training of the loop consistency generative adversarial network,which has a slow convergence speed,is prone to overfitting,and is ineffective in image processing for complex scenes,an unsupervised heterologous image alignment model is proposed based on the im-provement of loop consistency generative adversarial network from the three aspects of the generator, the discriminator, and the loss function. The introduction of a jump connection with a feature transformation residual layer between the downsampling and upsampling of the generative network ensures the effective transfer of gradients, reduces the loss of information in the process of forward and backward propagation,and achieves the combination of low-level features and high-level features,thus alleviating the gradient vanishing and the gradient explosion, promoting the convergence of the neural network, and helping the network to learn more contextual information. The model is evaluated on a self-built orchard apple dataset and two public datasets,and the experiment concludes that on the basis of the improved generator,it is more appropriate to select the 70x70 PatchGAN discriminator for datasets with relatively large deformation,and the PixelGAN discriminator for datasets with relatively small deformation. Comparing with eight classical algorithms and evaluating with six performance metrics,the experimental results show that the comprehensive performance of the proposed model on the heterologous orchard apple dataset is better than that of the comparison algorithms. Future work will be done to improve the robustness of the model to the brightness and contrast of heterologous images and to lighten the model.

相似文献/References:

[1]焦晶萍 廖文和 沈建新.一种基于模板匹配法的眼底图像拼接方法[J].计算机技术与发展,2010,(04):148.
 JIAO Jing-ping,LIAO Wen-he,SHEN Jian-xin.A Fundus Image Mosaic Method Based on Template Matching[J].,2010,(07):148.
[2]翟利志 王敬东 李鹏.基于邻域信息的红外与可见光图像互信息配准[J].计算机技术与发展,2008,(10):151.
 ZHAI Li-zhi,WANG Jing-dong,LI Peng.Infrared and Visible Light Image Mutual Information Registration Based on Neighborhood Information[J].,2008,(07):151.
[3]冯林 颜世鹏 孙焘.图像配准中的一种特定区域轮廓提取算法[J].计算机技术与发展,2006,(03):11.
 FENG Lin,YAN Shi-peng,SUN Tao.A Contour Extraction Algorithm of Special Region in Image Registration[J].,2006,(07):11.
[4]邰伟鹏 栾干 岳建华[].基于轮廓特征匹配的数字人多模态图像配准[J].计算机技术与发展,2006,(07):186.
 TAI Wei-peng,LUAN Gan,YUE Jian-hua.Image Registration Among Multimodal Medical Based on Matching of Contour Characteristic[J].,2006,(07):186.
[5]吴福虎 罗斌 汤进 杨龙.基于边缘相关的红外热像配准[J].计算机技术与发展,2012,(07):88.
 WU Fu-hu,LUO Bin,TANG Jin,et al.Infrared Image Registration Based on Edge Correlation[J].,2012,(07):88.
[6]孙登第 卜令斌 赵海峰 罗斌.基于梯度相似性与Rényi熵图的图像配准算法[J].计算机技术与发展,2012,(12):97.
 SUN Deng-di,BU Ling-bin,ZHAO Hai-feng,et al.Image Registration Based on Rényi Entropic Graph Combined with Gradient Similarity[J].,2012,(07):97.
[7]丁南南.墨西哥帽小波和归一化伪Zernike矩的图像配准[J].计算机技术与发展,2014,24(04):72.
 DING Nan-nan.Image Registration Based on Mexican-hat Wavelets and Normalized Pseudo-Zernike Moments[J].,2014,24(07):72.
[8]王凤娇,陈光化,周文. 基于SIFT的POCS图像超分辨率重建[J].计算机技术与发展,2014,24(11):39.
 WANG Feng-jiao,CHEN Guang-hua,ZHOU Wen. Multi-frame Image Super-resolution Reconstruction Based on SIFT[J].,2014,24(07):39.
[9]雷飞,王文学,王雪丽,等. 基于改进SURF的实时视频拼接方法[J].计算机技术与发展,2015,25(03):32.
 LEI Fei,WANG Wen-xue,WANG Xue-li,et al. Real-time Video Stitching Method Based on Improved SURF[J].,2015,25(07):32.
[10]张凯[],杨红雨[][],兰时勇[][].基于CUDA的SIFT特征与拼接缝的全景图生成[J].计算机技术与发展,2015,25(09):22.
 ZHANG Kai[],YANG Hong-yu[][],LAN Shi-yong[][]. Panorama Generation of SIFT and Stitch Line Based on CUDA[J].,2015,25(07):22.

更新日期/Last Update: 2024-07-10