电子科技 ›› 2024, Vol. 37 ›› Issue (4): 62-68.doi: 10.16180/j.cnki.issn1007-7820.2024.04.009

• • 上一篇    下一篇

基于注意力机制和图像轮廓的实例分割算法

顾登华, 顾春华   

  1. 上海理工大学 光电信息与计算机工程学院,上海 200093
  • 收稿日期:2022-11-17 出版日期:2024-04-15 发布日期:2024-04-19
  • 作者简介:顾登华(1997-),男,硕士研究生。研究方向:实例分割。
    顾春华(1970-),男,博士,教授。研究方向:图像处理、人工智能、云计算。
  • 基金资助:
    上海市科委科技行动计划(20DZ2308700)

Instance Segmentation Based on Attention and Image Contour

GU Denghua, GU Chunhua   

  1. School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology, Shanghai 200093,China
  • Received:2022-11-17 Online:2024-04-15 Published:2024-04-19
  • Supported by:
    Science and Technology Action Plan of Shanghai Municipal Science and Technology Commission(20DZ2308700)

摘要:

基于图像轮廓的实例分割方法利用少量轮廓顶点来表示物体,减少了算法的参数量,提高了算法的运行效率,但导致算法的精度低于传统逐像素处理的分割算法,获得的分割结果质量较差。为提升算法的准确性,文中提出一种基于图像轮廓结合注意力机制的实例分割模型(Attend the Contour snake,AC-snake)。在主干网络中加入改进的大卷积核(Largekernel+)提升模型的感受野,提取更加丰富的特征信息。改进轮廓顶点变形阶段的网络结构,结合双通道注意力模块(Dual Channel attention,DC-attentio)加强轮廓顶点的有效信息,减少训练网络中的无效参数,提升检测精度和训练速度。实验结果表明,在Cityscapes验证数据集中,相较于原始模型,文中提出的改进模型性能有所提升。

关键词: 实例分割, 图像轮廓, 轮廓顶点, 逐像素, 注意力机制, 大卷积核, 感受野, 特征信息

Abstract:

Based on image contour, the instance segmentation method uses fewer contour nodes to represent an object, which effectively reduces the number of algorithmic parameters and improves its operation efficiency. However, with the segmentation result of poor quality, it is no match for traditional pixel-by-pixel processing segmentation algorithm in terms of accuracy. To improve the accuracy of the algorithm, it is of great necessity to introduce a refined model of the instance segmentation (Attend the Contour snake,AC-snake), which is based on image contour with a combination of attention mechanism. An improved Largekernel+ is added to the backbone network to improve the receptive field of the model and extract richer feature information. The network structure at the contour vertex deformation stage is improved, and the Dual Channel attention (DC-attentio) module is combined to enhance the effective information of contour vertex, reduce the invalid parameters in the training network, and improve the detection accuracy and training speed. The experimental results show that in Cityscapes validation data set, the improved model proposed in this study has improved performance when compared with the original model.

Key words: instance segmentation, image contour, contour node, pre-pixel, attention meachanism, large kernel, receptive field, feature information

中图分类号: 

  • TN247