Journal of Xidian University ›› 2024, Vol. 51 ›› Issue (6): 25-39.doi: 10.19665/j.issn1001-2400.20240909

• Information and Communications Engineering • Previous Articles     Next Articles

Dual attention pedestrian detector for occlusion scenario based on feature calibration

TANG Shuyuan1,2,3,4(), ZHOU Yiqing1,2,3,4(), LI Jintao1,2,3,4(), LIU Chang1,2,4(), SHI Jinglin1,2,3,4()   

  1. 1. State Key Laboratory of Processors,Institute of Computing Technology,CAS,Beijing 100190,China
    2. Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China
    3. University of the Chinese Academy of Sciences,Beijing 100049,China
    4. Beijing Key Laboratory of Mobile Computing and Pervasive Device,Beijing 100190,China
  • Received:2024-01-06 Online:2024-12-20 Published:2024-10-08
  • Contact: ZHOU Yiqing E-mail:tangshuyuan20b@ict.ac.cn;zhouyiqing@ict.ac.cn;jtli@ict.ac.cn;liuchang@ict.ac.cn;sjl@ict.ac.cn

Abstract:

One of the major challenges faced by pedestrian detection technology based on computer vision is the issue of occlusion,including inter-class occlusion caused by objects in the natural environment and intra-class occlusion between pedestrians.These intertwined occlusion patterns limit the performance of pedestrian detectors.To address this problem,this paper proposes a dual-attention detection network based on feature calibration within the standard Faster R-CNN pedestrian detection framework.The network first generates attention masks through supervised learning to represent the spatial features of pedestrians in the image.These masks are then fused with backbone features and combined with a channel attention mechanism to calibrate pedestrian regions.This approach enhances the visibility of pedestrian regions while reducing the impact of occluded parts on classification and regression.Additionally,a non-uniform sampling strategy based on occlusion rates is introduced,targeting hard examples to allow the network to better learn complex occlusion patterns.Experimental results demonstrate that in comparison with standard pedestrian detectors,the proposed method achieves a 2.5% performance improvement on the reasonable occlusion subset of the CityPersons validation dataset.

Key words: convolutional neural network, pedestrian detection, dual attention mechanism, feature calibration, hard example mining, occlusion rate

CLC Number: 

  • TP391