Electronic Science and Technology ›› 2023, Vol. 36 ›› Issue (5): 62-70.doi: 10.16180/j.cnki.issn1007-7820.2023.05.010
Previous Articles Next Articles
SHI Jianke,QIAO Meiying,LI Bingfeng,ZHAO Yan
Received:
2021-12-06
Online:
2023-05-15
Published:
2023-05-17
Supported by:
CLC Number:
SHI Jianke,QIAO Meiying,LI Bingfeng,ZHAO Yan. Underwater Occlusion Target Detection Algorithm Based on Attention Mechanism[J].Electronic Science and Technology, 2023, 36(5): 62-70.
Figure 5.
Diagram of three improved triplet attentions (a)Fusion of improved non-local neural network and the left branch of triplet attention (b)Fusion of improved non-local neural network and the middle branch of triplet attention (c)Fusion of improved non-local neural network and the right branch of triplet attention"
Table 6.
Detection results of different similarity functions /%"
相似度函数 | 海胆 | 扇贝 | 海星 | 海参 | mAP |
---|---|---|---|---|---|
Gaussian | 74.72 | 58.10 | 73.58 | 55.55 | 65.49 |
Embedded Gaussian | 75.10 | 57.02 | 74.17 | 56.35 | 65.66 |
Dot Product | 75.42 | 56.24 | 74.03 | 57.25 | 65.74 |
Concatenation | 76.30 | 57.15 | 74.43 | 57.34 | 66.31 |
Proposed | 77.14 | 60.25 | 77.60 | 59.20 | 68.55 |
Table 7.
Detection results of different detection algorithms"
方法 | 主干网络 | 海胆/% | 扇贝/% | 海星/% | 海参/% | mAP/% | 帧率/frames·s-1 |
---|---|---|---|---|---|---|---|
YOLOv3 | DarkNet-53 | 70.42 | 50.04 | 71.55 | 52.70 | 61.18 | 27 |
SSD300 | VGG-16 | 71.72 | 52.30 | 72.54 | 54.72 | 62.82 | 22 |
SSD512 | VGG-16 | 73.20 | 54.81 | 73.10 | 56.37 | 64.37 | 18 |
Faster R-CNN | VGG-16 | 73.35 | 54.03 | 72.52 | 55.32 | 63.81 | 13 |
Faster R-CNN | ResNet-50 | 75.10 | 57.02 | 74.17 | 56.35 | 65.66 | 12 |
Faster R-CNN | proposed | 77.14 | 60.25 | 77.60 | 59.20 | 68.55 | 10 |
Table 8.
Results of different attention mechanisms"
方法 | 海胆/% | 扇贝/% | 海星/% | 海参/% | mAP/% | 帧率/frames·s-1 |
---|---|---|---|---|---|---|
Baseline | 75.10 | 57.02 | 74.17 | 56.35 | 65.66 | 12 |
FRANet | 74.58 | 59.86 | 75.67 | 55.28 | 66.35 | 20 |
ResNet-50+SENet | 73.51 | 60.10 | 76.20 | 58.55 | 67.09 | 10 |
ResNet-50+CBAM | 71.34 | 59.56 | 72.74 | 56.36 | 65.00 | 9 |
ResNet-50+NNNet | 75.85 | 58.26 | 73.35 | 57.17 | 66.16 | 10 |
ResNet-50+TA | 74.24 | 57.52 | 74.94 | 57.55 | 66.06 | 12 |
ResNet-50+NNNet+TA | 76.23 | 58.75 | 76.25 | 58.68 | 67.47 | 7 |
Proposed | 77.14 | 60.25 | 77.60 | 59.20 | 68.55 | 10 |
Figure 9.
Detection results of different algorithms (a)Detection result of YOLOv3 (b)Detection result of FRANet (c)Detection result of SSD (d)ResNet-50 and non-local neural network fusion detection result (e)ResNet-50 and triplet attention fusion detection result (f)ResNet-50 and proposed attention fusion detection result"
Figure 10.
Visualization comparison of feature layer before and after backbone network fusion improved nonlocal neural network (a)Input image (b)Visualization result of the feature layer before using the improved triplet attention fusion (c)Visualization result of the feature layer after using the improved triplet attention fusion"
[1] | 林森, 赵颍. 水下光学图像中目标探测关键技术研究综述[J]. 激光与光电子学进展, 2020, 57(6):26-37. |
Lin Sen, Zhao Ying. Review on key technologies of target exploration in underwater optical images[J]. Laser and Optoelectronic Progress, 2020, 57(6):26-37. | |
[2] | 于红. 水产动物目标探测与追踪技术及应用研究进展[J]. 大连海洋大学学报, 2020, 35(6):793-804. |
Yu Hong. Research progress on object detection and tracking techniques utilization in aquaculture: A review[J]. Journal of Dalian Ocean University, 2020, 35(6):793-804. | |
[3] | 张胜虎, 马惠敏. 遮挡对于目标检测的影响分析[J]. 图学学报, 2020, 41(6):891-896. |
Zhang Shenghu, Ma Huimin. An analysis of occlusion influence on object detection[J]. Journal of Graphics, 2020, 41(6):891-896. | |
[4] | 赵晓飞, 于双和, 李清波, 等. 基于注意力机制的水下目标检测算法[J]. 扬州大学学报(自然科学版), 2021, 24(1):62-67. |
Zhao Xiaofei, Yu Shuanghe, Li Qingbo, et al. Underwater object detection algorithm based on attention mechanism[J]. Journal of Yangzhou University(Natural Science Edition), 2021, 24(1):62-67. | |
[5] |
Wei X Y, Yu L, Tian S W, et al. Underwater target detection with an attention mechanism and improved scale[J]. Multimedia Tools and Applications, 2021, 80(1):33747-33761.
doi: 10.1007/s11042-021-11230-2 |
[6] | Redmon J, Farhadi A. YOLOv3: An incremental improvement[C]. Salt Lake City: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2018:1-12. |
[7] | 邹梓吟, 盖绍彦, 达飞鹏, 等. 基于注意力机制的遮挡行人检测算法[J]. 光学学报, 2021, 41(15):157-165. |
Zou Ziyin, Gai Shaoyan, Da Feipeng, et al. Occluded pedestrian detection algorithm based on attention mechanism[J]. Acta Optica Sinica, 2021, 41(15):157-165. | |
[8] | Woo S, Park J, Lee J Y, et al. CBAM: Convolutional block attention module[C]. Munich: Proceedings of European Conference on Computer Vision, 2018:3-19. |
[9] |
Hu J, Shen L, Albanie S, et al. Squeeze-and-excitation networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(8):2011-2023.
doi: 10.1109/TPAMI.2019.2913372 pmid: 31034408 |
[10] | Wang X L, Xiao T T, Jiang Y N, et al. Repulsion loss: Detecting pedestrians in a crowd[C]. Salt Lake City: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2018:7774-7783. |
[11] | Zhang S F, Wen L Y, Bian X, et al. Occlusion-aware R-CNN: Detecting pedestrians in a crowd[C]. Munich: Proceedings of European Conference on Computer Vision, 2018:637-653. |
[12] | 张莹, 刘子龙, 万伟. 基于Faster R-CNN的无人机车辆目标检测[J]. 电子科技, 2021, 34(11):11-20. |
Zhang Ying, Liu Zilong, Wan Wei. UAV vehicle target detection based on Faster R-CNN[J]. Electronic Science and Technology, 2021, 34(11):11-20. | |
[13] | Lin W H, Zhong J X, Liu S, et al. RoIMix:Proposal-fusion among multiple images for underwater object detection[C]. Barcelona: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, 2020:2588-2592. |
[14] | Landskape D M, Nalamada T, Arasanipalai A U, et al. Rotate to attend: Convolutional triplet attention module[C]. Waikoloa: Proceedings of Conference on IEEE Winter Conference on Applications of Computer Vision, 2021:3139-3148. |
[15] | Wang X L, Girshick R, Gupta A, et al. Non-local neural networks[C]. Salt Lake City: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2018:7794-7803. |
[16] | 魏郭依哲, 陈思遥, 刘玉涛, 等. 水下图像增强和修复算法综述[J]. 计算机应用研究, 2021, 38(9):2561-2569. |
Wei Guoyizhe, Chen Siyao, Liu Yutao, et al. Survey of underwater image enhancement and restoration algorithms[J]. Application Research of Computers, 2021, 38(9):2561-2569. | |
[17] | Ma Y T, Lu T, Wu Y R. Multi-scale relational reasoning with regional attention for visual question answering[C]. Milan: Proceedings of the Twenty-fifth International Conference on Pattern Recognition, 2021:5642-5649. |
[18] | YamashitaT, FurukawaH,Fujiyoshi H. Multiple skip connections of dilated convolution network for semantic segmentation[C]. Athens: Proceedings of the Twenty-fifth IEEE International Conference on Image Processing, 2018:1593-1597. |
[19] |
Ren S Q, He K M, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149.
doi: 10.1109/TPAMI.2016.2577031 pmid: 27295650 |
[20] | Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]. Las Vegas: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2016:779-788. |
[21] | Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector[C]. Amsterdam: Proceedings of European Conference on Computer Vision, 2016:21-37. |
[22] | He K M, Zhang X Y, Ren S Q, et al. Deep residual learning for image recognition[C]. Las Vegas: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2016:770-778. |
[23] | Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[J]. Institute of Electronics, Information and Communication Engineers Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 2014, 14(9):1556-1561. |
[1] | WANG Ziyi, CHEN Shiping. Self-Supervised Network Intrusion Detection Model Based on Graph Contrastive Learning [J]. Electronic Science and Technology, 2025, 38(3): 22-31. |
[2] | ZHAO Yunfei, XUE Cunjin. A Remote Sensing Image Water Extraction Method by Combining Atrous Convolution and Pooling Models [J]. Electronic Science and Technology, 2025, 38(3): 40-46. |
[3] | CHEN Yuyang, LI Feng. Integration of CNN and Transformer for Retinal OCT Image Fluid Segmentation Method [J]. Electronic Science and Technology, 2025, 38(3): 47-59. |
[4] | ZHENG Fangliang, WANG Yannian, LIAN Jihong, RUAN Pei. Face Image Super-Resolution Reconstruction Based on Conditional Priori Swin Transformer [J]. Electronic Science and Technology, 2025, 38(2): 35-41. |
[5] | LAI Ying, JU Zhiyong, YE Yuxin. A Vehicle Detection Algorithm Based on Improved YOLOv4 [J]. Electronic Science and Technology, 2025, 38(1): 81-87. |
[6] | KUAI Xinchen, LI Ye. Hybrid Image Super-Resolution Reconstruction with Multiple and Multi-Scale Attention [J]. Electronic Science and Technology, 2024, 37(9): 34-42. |
[7] | HE Zhiqiang, SUN Zhanquan. Swin-Transformer-Based Carotid Ultrasound Image Plaque Segmentation [J]. Electronic Science and Technology, 2024, 37(9): 48-56. |
[8] | HE Xing, HUANG Yongming, ZHU Yong. Pavement Pothole Detection Method Based on Improved YOLOv5 [J]. Electronic Science and Technology, 2024, 37(7): 53-59. |
[9] | WANG Xinyu, ZHAO Jingwen, LIU Xiang, SHI Yunyu, SHE Yunlang. YOLOv3 Lung Nodule Detection Based on Coordinate Attention [J]. Electronic Science and Technology, 2024, 37(6): 1-7. |
[10] | ZHAO Xu, HU Demin. Multi-Path Parallel Multi-Scale Feature Reuse for Remote Sensing Image Super-Resolution [J]. Electronic Science and Technology, 2024, 37(6): 61-68. |
[11] | YE Yuxin, JU Zhiyong, LAI Ying. Traffic Sign Detection Algorithm Incorporating Receptive Field Enhancement Module and Attention Mechanism [J]. Electronic Science and Technology, 2024, 37(6): 8-16. |
[12] | YUAN Ao, QI Jinpeng, JIA Can, XUE Yuxin, GUO Yangyang. A Classification Method of Time Series Pathological Data Segments Based on Deep Learning [J]. Electronic Science and Technology, 2024, 37(6): 84-91. |
[13] | ZHU Zihao, SONG Yan. Lightweight Capsule Network Fusing Attention and Capsule Pooling [J]. Electronic Science and Technology, 2024, 37(5): 1-8. |
[14] | XIA Rongcheng, LIU Deer. Vehicle Detection and Analysis in Urban Waterlogging Area Based on Deep Learning [J]. Electronic Science and Technology, 2024, 37(5): 18-24. |
[15] | MA Wenjie, ZHANG Xuanxiong. Research on Blind Roads and Obstacle Recognition Algorithm Based on Deep Learning [J]. Electronic Science and Technology, 2024, 37(3): 75-83. |
|