电子科技 ›› 2025, Vol. 38 ›› Issue (8): 27-32.doi: 10.16180/j.cnki.issn1007-7820.2025.08.004

• • 上一篇    下一篇

基于Agglomerator特征提取的非迭代路由胶囊网络

倪庭轩, 宋燕()   

  1. 上海理工大学 光电信息与计算机工程学院,上海 200093
  • 收稿日期:2024-01-15 修回日期:2024-02-08 出版日期:2025-08-15 发布日期:2025-07-10
  • 通讯作者: 宋燕(1979-),女,E-mail:sonya@usst.edu.cn,博士,教授。研究方向:模式识别、数据分析和预测控制等。
  • 作者简介:倪庭轩(1997-),男,硕士研究生。研究方向:图像处理。
  • 基金资助:
    国家自然科学基金(62073223);上海市自然科学基金(22ZR1443400)

A Non-Iterative Routing CapsNet Based on Agglomerator Feature Extraction

NI Tingxuan, SONG Yan()   

  1. School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China
  • Received:2024-01-15 Revised:2024-02-08 Online:2025-08-15 Published:2025-07-10
  • Supported by:
    National Natural Science Foundation of China(62073223);Shanghai Natural Science Foundation(22ZR1443400)

摘要:

针对胶囊网络特征提取器可解释性低的问题,文中提出了一种融合DenseCap(Dense Capsule)和Agglomerator的新特征提取器。将密集连接的低级高级特征与Agglomerator的局部整体特征相结合,使相邻两层特征对应局部和整体,提高了可解释性。DenseCap和Agglomerator的并联连接方式使模型结构更紧凑,能够减少可训练参数。将绝对位置编码与Agglomerator密集连接,在计算相对注意力时保留绝对值编码和相对位置编码的优点,保持平移等变性。实验结果表明,相较于胶囊网络以及原始Agglomerator,Agg-CapsNet(Agglomerator Capsule Network)在CIFAR10、MNIST、Fashion-MNIST和SmallNorb方面精度更好。在位置编码的平移实验中,通过可视化证明Agg-CapsNet具有平移等变性。

关键词: 特征提取, 胶囊网络, Agglomerator, 位置编码, 等变性, 注意力, 局部整体特征, 对比学习

Abstract:

In view of the problem of low interpretability of the feature extractor of capsule network, a new feature extractor combining DenseCap(Dense Capsule) and Agglomerator is proposed in this study. By combining the densely connected low-level and high-level features with the local global features of Agglomerator, the adjacent two layers of features correspond to the local and the whole, which improves the interpretability. The parallel connection of DenseCap and Agglomerator makes the model structure more compact and reduces trainable parameters. Dense connection of absolute position coding with Agglomerator preserves the advantages of absolute value coding and relative position coding when calculating relative attention, and maintains translational isotropy. The experimental results show that compared with the Capsule Network and the original Agglomerator, the Agg-CapsNet (Agglomerator Capsule Network) has better accuracy in terms of CIFAR10, MNIST, Fashion MNIST and SmallNorb. In translation experiments of position coding, Agg-CapsNet is proved to have translational isotropy by visualization.

Key words: feature extraction, capsule network, Agglomerator, position coding, invariance, attention, part and whole feature, contrastive learning

中图分类号: 

  • TP183