西安电子科技大学学报 ›› 2025, Vol. 52 ›› Issue (2): 1-12.doi: 10.19665/j.issn1001-2400.20241201

• •    下一篇

针对深度神经网络的高效光学对抗攻击

戚富琪(), 高海昌(), 李博凌(), 邹翔()   

  1. 西安电子科技大学 计算机科学与技术学院,陕西 西安 710071

Effective adversarial optical attacks on deep neural networks

QI Fuqi(), GAO Haichang(), LI Boling(), ZOU Xiang()   

  1. School of Computer Science and Technology,Xidian University,Xi’an 710071,China
  • Received:2024-07-10 Online:2025-04-20 Published:2024-12-17

摘要:

随着对抗攻击算法的不断更新,深度神经网络面临的安全风险愈加严峻。由于光学现象在真实世界中出现频繁,对光学对抗攻击的抗干扰能力直观反应了深度神经网络在实际应用中的安全性。然而,目前光学对抗攻击方面的研究普遍存在光学对抗扰动失真和优化不稳定的问题。为此,提出了一种新型光学攻击方法AdvFlare,以便于探究眩光扰动对深度神经网络安全性的影响。AdvFlare构造了一种参数化的眩光仿真模型,该模型对眩光的形状和颜色等多个属性进行建模,仿真效果好。在此基础上,提出了参数空间限制、随机初始化和分步优化的策略,解决了对抗扰动失真与收敛困难的问题。实验结果表明,与现有方法相比,AdvFlare能够以极高的成功率让深度神经网络误分类,具有稳定和扰动逼真度高的优点。此外,还发现,无论在数字域还是物理域,利用AdvFlare进行对抗训练能够显著提高深度神经网络的抗干扰能力,对提高公共交通场景下的模型鲁棒性有启发作用。

关键词: 深度神经网络, 对抗攻击, 眩光效应, 模型鲁棒性, 对抗训练

Abstract:

With the continuous advancement of adversarial attack algorithms,the security risks that deep neural networks face are increasingly severe.Optical phenomena frequently occur in real-world scenarios,and the robustness against optical adversarial attacks directly reflects the safety of deep neural networks.Nevertheless,current research on optical adversarial attacks commonly encounters challenges such as optical perturbation distortion and optimization instability.To solve this problem,this paper proposes a novel optical attack method named AdvFlare,to help explore the effect of flare perturbations on the safety of deep neural networks.AdvFlare constructs a parameterized flare simulation model,which models the multiple attributes of the flare pattern,such as shape and color,with great simulation,on the basis of which this paper addresses the problems of adversarial perturbation distortion and convergence difficulties through strategies such as parameter space constraints,random initialization,and stepwise optimization.Experimental results indicate that AdvFlare can induce misclassification in deep neural networks with a significantly higher success rate compared to existing methods,while also offering a superior visual perturbation quality and stability.Furthermore,it is discovered that adversarial training using AdvFlare can markedly enhance the robustness of deep neural networks,in both the digital and physical world,providing valuable insights for improving model robustness in public transportation contexts.

Key words: deep neural networks, adversarial attacks, flare effect, model robustness, adversarial training

中图分类号: 

  • TP183