Journal of Xidian University ›› 2022, Vol. 49 ›› Issue (3): 28-35.doi: 10.19665/j.issn1001-2400.2022.03.004
• Information and Communications Engineering • Previous Articles Next Articles
YANG Zixuan1(),XIAO Song1,2(),DONG Wenqian1(),QU Jiahui1()
Received:
2021-02-03
Revised:
2022-02-23
Online:
2022-06-20
Published:
2022-07-04
CLC Number:
YANG Zixuan,XIAO Song,DONG Wenqian,QU Jiahui. Thermal target detection method introducing an attention mechanism[J].Journal of Xidian University, 2022, 49(3): 28-35.
"
方法 | APbbox | A | A | A | A | A | 参数量/M | GFLOPS |
---|---|---|---|---|---|---|---|---|
Cascade-RCNN | 0.375 | 0.756 | 0.310 | 0.274 | 0.442 | 0.471 | 69.17 | 52.64 |
+SENet-1 | 0.364 | 0.737 | 0.296 | 0.262 | 0.431 | 0.445 | 109.04 | 52.69 |
+SENet-8 | 0.376 | 0.755 | 0.308 | 0.275 | 0.441 | 0.464 | 74.17 | 52.66 |
+SENet-16 | 0.371 | 0.744 | 0.304 | 0.273 | 0.434 | 0.441 | 71.68 | 52.66 |
+CBAM | 0.382 | 0.756 | 0.323 | 0.281 | 0.448 | 0.453 | 71.66 | 52.68 |
+GCNet-4 | 0.385 | 0.769 | 0.320 | 0.284 | 0.451 | 0.484 | 79.17 | 52.66 |
+GCNet-16 | 0.380 | 0.758 | 0.320 | 0.280 | 0.446 | 0.457 | 71.69 | 52.66 |
+ECANet | 0.371 | 0.755 | 0.306 | 0.275 | 0.434 | 0.439 | 69.17 | 52.65 |
+TripleNet | 0.381 | 0.759 | 0.322 | 0.280 | 0.448 | 0.480 | 69.17 | 52.73 |
+FDA-light | 0.391 | 0.773 | 0.332 | 0.292 | 0.453 | 0.471 | 69.18 | 52.87 |
+FDA-SAM | 0.387 | 0.779 | 0.323 | 0.284 | 0.453 | 0.510 | 74.20 | 52.75 |
+FDA-FM | 0.384 | 0.768 | 0.327 | 0.283 | 0.449 | 0.465 | 74.49 | 52.75 |
+FDA-SFAM | 0.390 | 0.772 | 0.327 | 0.283 | 0.461 | 0.483 | 74.75 | 52.76 |
+FDA | 0.394 | 0.780 | 0.335 | 0.292 | 0.461 | 0.470 | 74.75 | 52.91 |
[1] | ANDRAŠI P, RADIŠIĆ T, MUŠTRA M, et al. Night-time Detection of UAVs Using Thermal Infrared Camera[J]. Transportation Research Procedia, 2017,28:183-190. |
[2] | CHROMY A, KLIMA O. A 3D Scan Model and Thermal Image Data Fusion Algorithms for 3D Thermography in Medicine[J]. Journal of Healthcare Engineering, 2017, 2017:5134021. |
[3] | 李浩. 基于计算机视觉的电力设备红外检测研究[D]. 长沙: 长沙理工大学, 2019. |
[4] | LI J, GONG W, LI W, et al. Robust Pedestrian Detection in Thermal Infrared Imagery Using the Wavelet Transform[J]. Infrared Physics & Technology, 2010, 53(4):267-273. |
[5] |
BAI X, BI Y. Derivative Entropy-Based Contrast Measure for Infrared Small-Target Detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2018, 56(4):2452-2466.
doi: 10.1109/TGRS.2017.2781143 |
[6] |
ZHANG T, WU H, LIU Y, et al. Infrared Small Target Detection Based on Non-Convex Optimization with Lp-Norm Constraint[J]. Remote Sensing, 2019, 11(5):559.
doi: 10.3390/rs11050559 |
[7] | MORADI S, MOALLEM P, FARZAN SABAHI M. A False-Alarm Aware Methodology to Develop Robust and Efficient Multi-Scale Infrared Small Target Detection Algorithm[J]. Infrared Physics & Technology, 2018, 89:387-397. |
[8] |
MAO Q, FANG X, HU Y, et al. Chiller Sensor Fault Detection Based on Empirical Mode Decomposition Threshold Denoising and Principal Component Analysis[J]. Applied Thermal Engineering, 2018, 144:21-30.
doi: 10.1016/j.applthermaleng.2018.07.104 |
[9] |
LIU Y, WANG Z, SI L, et al. A Non-Reference Image Denoising Method for Infrared Thermal Image Based on Enhanced Dual-Tree Complex Wavelet Optimized by Fruit Fly Algorithm and Bilateral Filter[J]. Applied Science, 2017, 7(12):1190.
doi: 10.3390/app7111190 |
[10] |
SHI M, WANG H. Infrared Dim and Small Target Detection Based on Denoising Autoencoder Network[J]. Mobile Networks and Applications, 2020, 25:1469-1483.
doi: 10.1007/s11036-019-01377-6 |
[11] | BOCHKOVSKI Y A, WANG C Y, LIAO H Y M. YOLOv4:Optimal Speed and Accuracy of Object Detection (2020)[J/OL]. [2020-04-23]. https://arxiv.org/abs/2004.10934. |
[12] | HEI L, DENG J. CornerNet:Detecting Objects as Paired Keypoints[C]// Proceedings of the European Conference on Computer Vision (ECCV).Heidelberg:Springer, 2018:765-781. |
[13] | CAI Z, VASCONCELOS N. Cascade R-CNN:Delving Into High Quality Object Detection[C]// Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2018:6154-6162. |
[14] | YU B. An Improved Infrared Image Processing Method Based on Adaptive Threshold Denoising[J]. EURASIP Journal on Image and Video Processing, 2019, 5:1-12. |
[15] |
LIAO J, XIANG G, CAO L, et al. TheLeft-Behind Human Detection and Tracking System Based on Vision with Multi-Model Fusion and Microwave Radar inside the Bus[J]. Journal of Automobile Engineering, 2020, 234(9):2342-2354.
doi: 10.1177/0954407020912137 |
[16] | VASWANI A, SHAZEER N, PARMAR N, et al. Attention is All You Need[J]. Advances in Neural Information Processing Systems, 2017:5998-6008. |
[17] | HU J, SHEN L, SUN G. Squeeze-Andexcitation Networks[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2018:7132-7141. |
[18] | LI X, WANG W, HU X, et al. Selective Kernel Networks[C]// Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).Piscataway:IEEE, 2019:510-519. |
[19] | WANG Q, WU B, ZHU P, et al. ECA-Net:Efficient Channel Attention for Deep Convolutional Neural Networks[C]// Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).Piscataway:IEEE, 2020:11534-11542. |
[20] | WANG X, GIRSHICK R, GUPTA A, et al. Non-Local Neural Networks[C]// Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2018:7794-7803. |
[21] | CAO Y, XU J, LIN S, et al. GCNet:Non-Local Networks Meet Squeeze-Excitation Networks and Beyond[C]// Proceedings of 2019 IEEE/CVF International Conference on Computer Vision Workshop.Piscataway:IEEE, 2019:1970-1980. |
[22] | WOO S, PARK J, LEE J, et al. CBAM:Convolutional Block Attention Module[C]// Proceedings of the European Conference on Computer Vision (ECCV).Heidelberg:Springer, 2018:3-19. |
[23] | FU J, LIU J, TIAN H, et al. Dual Attention Network for Scene Segmentation[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2019:3146-3154. |
[24] | CHEN Y, KALANTIDIS Y, LI J, et al. A2-Nets:Double Attention Networks[C]// Proceedings of the 32nd International Conference on Neural Information Processing Systems.New York:ACM, 2018:352-361. |
[25] | MISRA D, NALAMADA T, ARASANIPALAI A U, et al. Rotate to Attend:Convolutional Triplet Attention Module[C]// Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.Piscataway:IEEE, 2020:3139-3148. |
[26] | QIN Z, ZHANG P, WU F, et al. Fcanet:Frequency Channel Attention Networks (2020)[J/OL]. [2020-12-22]. https://arxiv.org/abs/2012.11879v1. |
[27] | GUO H, TANG R, YE Y, et al. DeepFM:A Factorization-Machine Based Neural Network for CTR Prediction[C]// Proceedings of the 26th International Joint Conference on Artificial Intelligence. New York: ACM, 2017:1728-1731. |
[28] | HWANG S, PARK J, KIM N, et al. Multispectral Pedestrian Detection:Benchmark Dataset and Baseline[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2015:1037-1045. |
[1] | HE Yunbin,LENG Xin,WAN Jing. Unbalanced data weighted boundary point integration undersampling method [J]. Journal of Xidian University, 2021, 48(4): 176-183. |
[2] | YANG Yanbo,ZHANG Jiawei,MA Jianfeng. Method for using the blockchain to protect data privacy of IoV [J]. Journal of Xidian University, 2021, 48(3): 21-30. |
[3] | ZHAI Sheping,WANG Yijing,CHEN Siji. Research on the application of blockchain technology in the sharing of electronic medical records [J]. Journal of Xidian University, 2020, 47(5): 103-112. |
[4] | CHEN Siji,ZHAI Sheping,WANG Yijing. Blockchain privacy protection algorithm based on ring signature [J]. Journal of Xidian University, 2020, 47(5): 86-93. |
[5] | CUI Jiang-tao;GUO Yong;LI Guang-xin. Efficient video indexing method using dynamic distance measure for the principal component [J]. J4, 2009, 36(6): 1086-1091. |
[6] |
WANG Jin-shuang;ZHANG Xing-yuan;ZHANG Yu-sen.
Mechanizing probability theory in Isabelle/HOL [J]. J4, 2007, 34(7): 197-200. |
[7] |
MIAO Chun-yu;SHI Mei-lin.
A cooperative model for multi-database under the Internet |
[8] |
CUI Jiang-tao(1);SUN Jun-ding(1;2);ZHOU Li-hua(1).
An efficient high-dimensional image indexing method for relevance feedback [J]. J4, 2006, 33(1): 62-65. |
[9] |
CUI Jiang-tao;SUN Jun-ding;ZHOU Li-hua.
A multi-resolution indexing method for high-dimensional image databases using the wavelet transform [J]. J4, 2005, 32(3): 370-373. |
[10] | Authors. title [J]. J4, 2003, 30(4): 556-561. |
[11] | Authors. title [J]. J4, 2003, 30(4): 486-492. |
[12] | Authors. title [J]. J4, 2002, 29(5): 684-690. |
[13] | Authors. title [J]. J4, 2001, 28(6): 768-772. |
[14] | Authors. title [J]. J4, 1999, 26(1): 0-0. |
[15] | Authors. title [J]. J4, 1998, 25(6): 0-0. |
|