西安电子科技大学学报 ›› 2021, Vol. 48 ›› Issue (2): 205-212.doi: 10.19665/j.issn1001-2400.2021.02.026

• 计算机科学与技术 • 上一篇    



  1. 1.西安电子科技大学 通信工程学院,陕西 西安 710071
    2.西安空间无线电技术研究所,陕西 西安 710100
    3.重庆邮电大学 图像认知重庆市重点实验室,重庆 400065
  • 收稿日期:2020-06-12 修回日期:2020-01-06 出版日期:2021-04-20 发布日期:2021-04-28
  • 通讯作者: 杨曦
  • 作者简介:魏梓钰(1996—),女,西安电子科技大学博士研究生,E-mail: zywei_xd@stu.xidian.edu.cn|王楠楠(1986—),男,教授,E-mail: nnwang@xidian.edu.cn|杨东(1988—),男,高级工程师,E-mail: yangd504@126.com|高新波(1972—),男,教授,E-mail: gaoxb@cqupt.edu.cn,xbgao@mail.xidian.edu.cn
  • 基金资助:

Reciprocal bi-directional generative adversarial network for cross-modal pedestrian re-identification

WEI Ziyu1(),YANG Xi1(),WANG Nannan1(),YANG Dong2(),GAO Xinbo3()   

  1. 1. School of Telecommunications Engineering,Xidian University,Xi’an 710071,China
    2. Xi’an Institute of Space Radio Technology,Xi’an 710100,China
    3. Chongqing Key Laboratory of Image Cognition,Chongqing University of Posts and Telecommunications,Chongqing 400065,China
  • Received:2020-06-12 Revised:2020-01-06 Online:2021-04-20 Published:2021-04-28
  • Contact: Xi YANG



关键词: 生成对抗网络, 图像转换, 特征提取, 跨模态行人重识别


To improve the accuracy of cross-modal pedestrian re-identification,a reciprocal bi-directional generative adversarial network-based method is proposed.First,we build two generative adversarial networks to generate cross-modal heterogeneous images.Second,an associated loss is designed to pull close the distribution of features in latent space during the image translation between visible and infrared images so as to help the networks generate fake heterogeneous images that have high similarity with the real images.Finally,by concatenating the original and generated heterogeneous pedestrian images into the discriminative feature extraction network,images from different modalities can be unified into a common modality,thus suppressing the cross-modal gap.Representation learning and metric learning are utilized to achieve more discriminative pedestrian features.Comparative experiments are conducted on SYSU-MM01 and RegDB datasets to analyze the accuracy with different loss functions.Compared with other state-of-the-art cross-modal pedestrian re-identification methods,the proposed method achieves a higher accuracy and stronger robustness.

Key words: generative adversarial networks, image translation, feature extraction, cross-modal pedestrianre-identification


  • TN911.73