Electronic Science and Technology ›› 2024, Vol. 37 ›› Issue (5): 1-8.doi: 10.16180/j.cnki.issn1007-7820.2024.05.001

    Next Articles

Lightweight Capsule Network Fusing Attention and Capsule Pooling

ZHU Zihao, SONG Yan   

  1. School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
  • Received:2022-04-06 Online:2024-05-15 Published:2024-05-21
  • Supported by:
    National Natural Science Foundation of China(62073223);Shanghai Natural Science Foundation(22ZR1443400);Open Project of the National Defense Science and Technology Key Laboratory of Aerospace Flight Dynamics(6142210200304)

Abstract:

In view of the inefficiency of feature information propagation in capsule networks and the huge computational overhead in the routing process, a graph pooling capsule network that combines attention and capsule pooling is proposed. The network mainly has the following two advantages: 1) The capsule attention is proposed, and the attention is applied to the primary capsule layer, which enhances the attention to the important capsules, and improves the accuracy of the prediction of the lower capsules to the higher capsules; 2) A new capsule pooling is proposed. The capsule with the largest weight is screened out at the corresponding positions of all feature maps in the primary capsule layer, and the effective feature information is represented by a small number of important capsules while reducing the number of model parameters. Results on public data sets show that the proposed capsule network achieves the accuracy of 92.60% on CIFAR10 and has excellent robustness against white-box adversarial attacks on complex datasets. In addition, the proposed capsule network achieves 95.74% accuracy on the AffNIST data set with superior affine transformation robustness. The calculation efficiency results show that the amount of floating-point operations of the proposed capsule is reduced by 31.3% and the number of parameters is reduced by 41.9% when compared with traditional CapsNet.

Key words: deep learning, image classification, capsule network, capsule pooling, attention mechanism, robustness, adversarial attack, lightweight

CLC Number: 

  • TP391.4