西安电子科技大学学报 ›› 2025, Vol. 52 ›› Issue (3): 217-231.doi: 10.19665/j.issn1001-2400.20250503
• 第二十七届中国科协年会——AI时代网络技术创新 • 上一篇 下一篇
顿皓(
), 田春娜(
), 李向阳(
), 单笑(
), 郭宇杰(
)
收稿日期:2024-11-01
出版日期:2025-06-20
发布日期:2025-05-21
通讯作者:
田春娜(1980—),女,教授,E-mail:chnatian@xidian.edu.cn作者简介:顿 皓(2002—),男,西安电子科技大学硕士研究生,E-mail:hdun@stu.xidian.edu.cn基金资助:
DUN Hao(
), TIAN Chunna(
), LI Xiangyang(
), SHAN Xiao(
), GUO Yujie(
)
Received:2024-11-01
Online:2025-06-20
Published:2025-05-21
摘要:
现有基于深度学习的SAR目标分类算法性能依赖于大量数据,但受限于高采集成本和目标非合作性等影响,获取的数据量有限,从而导致算法性能显著下降,限制了雷达图像分类的广泛应用。目前基于生成对抗网络的SAR图像生成方法存在生成质量低、易发生模式坍塌和数据影响因子不可控等问题。为此,提出了一种SAR图像快速扩散生成方法,以提高基于深度学习方法的雷达目标识别准确率。通过在扩散模型中嵌入基于互注意力的条件机制,将SAR图像目标的类别和方位角分别进行编码,从而引导扩散模型生成指定类别和方位角的SAR图像。为解决扩散模型生成速度慢的问题,采用快速求解器DPM-Solver加速采样过程,并探究不同超参数对生成质量的影响。在MSTAR数据集上的实验表明,在数据量为60%时,扩散模型生成的增广样本可以将目标分类模型对SAR目标的分类准确率从95.16%提升至97.62%,并且使用DPM-Solver加速生成后的增广样本能够进一步将分类准确率提高到97.98%。
中图分类号:
顿皓, 田春娜, 李向阳, 单笑, 郭宇杰. SAR图像快速扩散生成方法[J]. 西安电子科技大学学报, 2025, 52(3): 217-231.
DUN Hao, TIAN Chunna, LI Xiangyang, SHAN Xiao, GUO Yujie. Accelerated diffusion-based method for SAR image generation[J]. Journal of Xidian University, 2025, 52(3): 217-231.
表3
MSTAR数据集中使用DPM-Solver不同超参数生成图像的距离得分"
| 步数 | 二阶 | 三阶 | ||||
|---|---|---|---|---|---|---|
| 均匀 | 二次型 | 对数信噪比 | 均匀 | 二次型 | 对数信噪比 | |
| 10 | 22.13 | 8.91 | 5.00 | 61.49 | 15.17 | 14.15 |
| 15 | 21.51 | 8.29 | 5.78 | 67.75 | 8.32 | 7.35 |
| 20 | 21.07 | 7.45 | 6.06 | 48.98 | 6.44 | 6.55 |
| 25 | 19.23 | 6.90 | 6.03 | 32.49 | 6.12 | 6.18 |
| 50 | 11.31 | 6.22 | 6.05 | 10.72 | 6.03 | 6.07 |
| 100 | 7.43 | 6.09 | 6.00 | 5.81 | 6.03 | 6.03 |
| 250 | 6.21 | 6.04 | 6.00 | 5.99 | 6.02 | 6.02 |
表5
生成图像质量评价结果"
| 数据集 | 均值 | 方差 | 等效视数 | 辐射分辨率 | IS | 距离得分 | 生成时间/(s·张-1) |
|---|---|---|---|---|---|---|---|
| MSTAR数据集 | 28.17 | 1 134.27 | 0.758 | 3.364 | 1.187 | ||
| DDPM | 31.89 | 1 637.14 | 0.673 | 3.493 | 1.119 | 37.65 | 6.355 |
| DiT-S/2 | 33.58 | 1 823.60 | 0.682 | 3.494 | 1.280 | 8.83 | 11.622 |
| 文中方法 | 30.56 | 1 171.69 | 0.859 | 3.219 | 1.177 | 3.91 | 0.041 |
| SAMPLE数据集 | 49.57 | 838.55 | 2.99 | 2.01 | 1.305 | ||
| DDPM | 48.29 | 809.60 | 2.93 | 2.02 | 1.338 | 6.48 | 6.381 |
| DiT-S/2 | 52.29 | 994.41 | 2.84 | 2.08 | 1.684 | 37.63 | 11.602 |
| 文中方法 | 50.39 | 840.20 | 3.08 | 1.99 | 1.344 | 5.83 | 0.041 |
表9
使用互注意力机制前后的生成图像质量评价结果及生成时间"
| 数据集 | 均值 | 方差 | 等效视数 | 辐射分辨率 | IS | 距离得分 | 生成时间/(s·张-1) |
|---|---|---|---|---|---|---|---|
| MSTAR数据集 | 28.17 | 1 134.27 | 0.758 | 3.364 | 1.187 | ||
| 文中方法(无互注意力) | 29.60 | 1 463.22 | 0.672 | 3.520 | 1.142 | 10.63 | 0.022 |
| 文中方法 | 30.56 | 1 171.69 | 0.859 | 3.219 | 1.177 | 3.91 | 0.041 |
| SAMPLE数据集 | 49.57 | 838.55 | 2.99 | 2.01 | 1.305 | ||
| 文中方法(无互注意力) | 48.36 | 840.32 | 2.85 | 2.05 | 1.277 | 8.44 | 0.023 |
| 文中方法 | 50.39 | 840.20 | 3.08 | 1.99 | 1.344 | 5.83 | 0.041 |
| [1] | 谢雯, 滑文强, 焦李成, 等. 采用深度学习的极化SAR地物分类方法综述[J]. 西安电子科技大学学报, 2023, 50(3):151-170. |
| XIE Wen, HUA Wenqiang, JIAO Licheng, et al. Review on Polarimetric SAR Terrain Classification Methods Using Deep Learning[J]. Journal of Xidian University, 2023, 50(3):151-170. | |
| [2] | 赵鹏菲, 黄丽佳. 一种基于EfficientNet与BiGRU的多角度SAR图像目标识别方法[J]. 雷达学报, 2021, 10(6):895-904. |
| ZHAO Pengfei, HUANG Lijia. Target Recognition Method for Multi-Aspect Synthetic Aperture Radar Images Based on EfficientNet and BiGRU[J]. Journal of Radars, 2021, 10(6):895-904. | |
| [3] | MA X, JI K, FENG S, et al. Open Set Recognition with Incremental Learning for SAR Target Classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61(1):1-14. |
| [4] | GENG Z, XU Y, WANG B N, et al. Target Recognition in SAR Images by Deep Learning with Training Data Augmentation[J]. Sensors, 2023, 23(2):941. |
| [5] | KANG M, ZHU J Y, ZHANG R, et al. Scaling up GANs for Text-to-Image Synthesis[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2023:10124-10134. |
| [6] | MA X, FANG G, WANG X. DeepCache:Accelerating Diffusion Models for Free[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2024:15762-15772. |
| [7] | WU T, FAN Z, LIU X, et al. AR-Diffusion:Auto-Regressive Diffusion Model for Text Generation[C]// Proceedings of the Advances in Neural Information Processing Systems 36:Annual Conference on Neural Information Processing Systems 2023. San Diego: NIPS,2023:39957-39974. |
| [8] | DENG J, LU J, ZHANG T. Diff3DETR:Agent-Based Diffusion Model for Semi-Supervised 3D Object Detection[C]// European Conference on Computer Vision. Berlin:Springer,2024:57-73. |
| [9] | ESSER P, KULAL S, BLATTMANN A, et al. Scaling Rectified Flow Transformers for High-Resolution Image Synthesis[C]// ICML'24:Proceedings of the 41th International Conference on Machine Learning. San Diego: PMLR,2024:12606-12633. |
| [10] | KINGMA D P, WELLING M. Auto-Encoding Variational Bayes[C]// 2nd International Conference on Learning Representations. Banff:ICLR,2013:1-14. |
| [11] | GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative Adversarial Networks[J]. Communications of the ACM, 2020, 63(11):139-144. |
| [12] | MIRZA M, OSINDERO S. Conditional Generative Adversarial Nets (2014)[J/OL]. [2014-11-06]. https://arxiv.org/abs/1411.1784. |
| [13] | ODENA A, OLAH C, SHLENS J. Conditional Image Synthesis with Auxiliary Classifier GANs[C]// ICML'17:Proceedings of the 34th International Conference on Machine Learning. San Diego: PMLR,2017:2642-2651. |
| [14] | CHEN X, DUAN Y, HOUTHOOFT R, et al. InfoGAN:Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets[C]// Proceedings of the Advances in Neural Information Processing Systems 29:Annual Conference on Neural Information Processing Systems 2016. San Diego: NIPS,2016:2180-2188. |
| [15] | KARRAS T, LAINE S, AILA T. A Style-Based Generator Architecture for Generative Adversarial Networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43(12):4217-4228. |
| [16] | SAUER A, KARRAS T, LAINE S, et al. StyleGAN-T:Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis[C]// ICML'23:Proceedings of the 40th International Conference on Machine Learning. San Diego: PMLR,2023:30105-30118. |
| [17] | GUO J, LEI B, DING C, et al. Synthetic Aperture Radar Image Synthesis by Using Generative Adversarial Nets[J]. IEEE Geoscience and Remote Sensing Letters, 2017, 14(7):1111-1115. |
| [18] | ZHAI J, DANG X, CHEN F, et al. SAR Image Generation Using Structural Bayesian Deep Generative Adversarial Network[C]// 2019 Photonics & Electromagnetics Research Symposium -Fall. Xiamen:PIERS,2019:1386-1392. |
| [19] | 李诗怡, 付光远, 崔忠马, 等. 基于多尺度生成对抗网络的SAR图像样本增广[J]. 激光与光电子学进展, 2020, 57(20):175-185. |
| LI Shiyi, FU Guangyuan, CUI Zhongma, et al. SAR Aircraft Data Sets Augmentation Based on Multi-Scale Generative Adversarial Network[J]. Laser & Optoelectronics Progress, 2020, 57(20):175-185. | |
| [20] | OH J, KIM M. PeaceGAN:A GAN-Based Multi-Task Learning Method for SAR Target Image Generation with a Pose Estimator and an Auxiliary Classifier[J]. Remote Sensing, 2021, 13(19):3939. |
| [21] | FENG Z, DAKOVIC M, JI H, et al. Interpretation of Latent Codes in InfoGAN with SAR Images[J]. Remote Sensing, 2023, 15(5):1254. |
| [22] | HO J, JAIN A, ABBEEL P. Denoising Diffusion Probabilistic Models[C]// Proceedings of the Advances in Neural Information Processing Systems 33:Annual Conference on Neural Information Processing Systems 2020. San Diego: NIPS,2020:6840-6851. |
| [23] | DHARIWAL P, NICHOL A. Diffusion Models Beat GANs on Image Synthesis[C]// Proceedings of the Advances in Neural Information Processing Systems 34:Annual Conference on Neural Information Processing Systems 2021. San Diego: NIPS,2021:8780-8794. |
| [24] | SAHARIA C, CHAN W, SAXENA S, et al. Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding[C]// Proceedings of the Advances in Neural Information Processing Systems 35:Annual Conference on Neural Information Processing Systems 2022. San Diego: NIPS,2022:36479-36494. |
| [25] | PEEBLES W, XIE S. Scalable Diffusion Models with Transformers[C]// Proceedings of the IEEE/CVF International Conference on Computer Vision. Piscataway:IEEE,2023:4195-4205. |
| [26] | CHEN J, YU J, GE C, et al. PixArt-α:Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis[C]// 12th International Conference on Learning Representations. Vienna:ICLR,2024:1-30. |
| [27] | ZHENG W, TENG J, YANG Z, et al. CogView3:Finer and Faster Text-to-Image Generation via Relay Diffusion[C]// European Conference on Computer Vision. Berlin:Springer,2024:1-22. |
| [28] | LU C, ZHOU Y, BAO F, et al. DPM-Solver:A Fast Ode Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps[C]// Proceedings of the Advances in Neural Information Processing Systems 35:Annual Conference on Neural Information Processing Systems 2022. San Diego: NIPS,2022:5775-5787. |
| [29] | MILDENHALL B, SRINIVASAN P P, TANCIK M, et al. NeRF:Representing Scenes as Neural Radiance Fields for View Synthesis[J]. Communications of the ACM, 2021, 65(1):99-106. |
| [30] | SONG Y, SOHL-DICKSTEIN J, KINGMA D P, et al. Score-Based Generative Modeling through Stochastic Differential Equations[C]// 9th International Conference on Learning Representations. Virtual:ICLR,2021:1-36. |
| [31] | HOCHBRUCK M, OSTERMANN A. Explicit Exponential Runge-Kutta Methods for Semilinear Parabolic Problems[J]. SIAM Journal on Numerical Analysis, 2005, 43(3):1069-1090. |
| [32] | LUAN V T. Efficient Exponential Runge-Kutta Methods of High Order:Construction and Implementation[J]. BIT Numerical Mathematics, 2021, 61(2):535-560. |
| [33] | KEYDEL E R, LEE S W, MOORE J T. MSTAR Extended Operating Conditions:A Tutorial[J]. Algorithms for Synthetic Aperture Radar Imagery Ⅲ, 1996,2757:228-242. |
| [34] | LEWIS B, SCARNATI T, SUDKAMP E, et al. A SAR Dataset for ATR Development:the Synthetic and Measured Paired Labeled Experiment (SAMPLE)[C]// SPIE Defense+Commercial Sensing.Algorithms for Synthetic Aperture Radar Imagery ⅩⅩⅥ: 10987. Baltimore:SPIE,2019:39-54. |
| [35] | 张倩. SAR图像质量评估及其目标识别应用[D]. 合肥: 中国科学技术大学, 2011. |
| [36] | HEUSEL M, RAMSAUER H, UNTERTHINER T, et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium[C]// Proceedings of the Advances in Neural Information Processing Systems 30:Annual Conference on Neural Information Processing Systems 2017. Piscataway:IEEE,2017:6629-6640. |
| [37] | SZEGEDY C, VANHOUCKE V, IOFFE S, et al. Rethinking the Inception Architecture for Computer Vision[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:2818-2826. |
| [38] | CHEN X, LIANG C, HUANG D, et al. Symbolic Discovery of Optimization Algorithms[C]// Proceedings of the Advances in Neural Information Processing Systems 36:Annual Conference on Neural Information Processing Systems 2023. San Diego: NIPS,2023:49205-49233. |
| [1] | 郑洋, 王榕旭, 郭开泰, 梁继民. ARWCGAN:一种高质量的多类别SAR图像生成方法[J]. 西安电子科技大学学报, 2025, 52(2): 101-112. |
| [2] | 龚峻扬, 付卫红, 方厚章. SAR图像舰船目标检测的轻量化和特征增强研究[J]. 西安电子科技大学学报, 2024, 51(2): 96-106. |
| [3] | 谢雯,滑文强,焦李成,王若男. 采用深度学习的极化SAR地物分类方法综述[J]. 西安电子科技大学学报, 2023, 50(3): 151-170. |
| [4] | 遆晶晶, 索志勇, 王婷婷, 赵秉吉, 张乐茹. 导航卫星星地双基SAR时频同步误差估计方法[J]. 西安电子科技大学学报, 2023, 50(2): 42-53. |
| [5] | 张强, 杨欣朋, 赵世祥, 卫栋栋, 韩臻. 注意力机制的SAR图像车辆目标检测网络[J]. 西安电子科技大学学报, 2023, 50(1): 36-47. |
| [6] | 杨静波,赵启军,吕泽均. 维度情感模型下的表情图像生成及应用[J]. 西安电子科技大学学报, 2021, 48(5): 30-37. |
| [7] | 王昕,常贵清,池琛琛. 广角SAR各向异性散射目标分离成像[J]. 西安电子科技大学学报, 2021, 48(2): 109-116. |
| [8] | 陆金文,闫华,殷红成,张磊,董纯柱. 用于三维散射中心SBR建模的边缘绕射修正[J]. 西安电子科技大学学报, 2021, 48(2): 117-124. |
| [9] | 陶毅,丁丽. 利用共轭对称性的数字IQ频域校准方法[J]. 西安电子科技大学学报, 2021, 48(2): 181-189. |
| [10] | 段崇棣,韩超垒,杨志伟,张庆君. 一种杂波分类辅助的近海岸模糊杂波抑制方法[J]. 西安电子科技大学学报, 2021, 48(2): 64-71. |
| [11] | 胡利平,闫华,钟卫军,殷红成,王超. 舰船目标三维散射中心建模及SAR快速仿真方法[J]. 西安电子科技大学学报, 2021, 48(2): 72-83. |
| [12] | 杨军,周芳. 分布式小卫星MIMO-SAR超高分辨成像方法[J]. 西安电子科技大学学报, 2021, 48(2): 99-108. |
| [13] | 王志浩,张金松,孙光才,李军,邢孟道. 一种联合多特征量的SAR人造微弱痕迹检测方法[J]. 西安电子科技大学学报, 2019, 46(6): 131-139. |
| [14] | 贾宇峰,马力. 条件约束下的自我注意生成对抗网络[J]. 西安电子科技大学学报, 2019, 46(6): 163-171. |
| [15] | 杨磊,岳云泽,李埔丞,章涛,杨桓. 多地面运动目标大动态SAR成像稀疏表示[J]. 西安电子科技大学学报, 2019, 46(5): 31-40. |
|
||