Electronic Science and Technology ›› 2024, Vol. 37 ›› Issue (10): 64-70.doi: 10.16180/j.cnki.issn1007-7820.2024.10.009

Previous Articles     Next Articles

Research on Action Recognition Method Based on Deep Learning

XIN Tenghao, LI Feifei   

  1. School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology, Shanghai 200093,China
  • Received:2023-03-14 Online:2024-10-15 Published:2024-11-04
  • Supported by:
    The Program for Professor of Special Appointment(Eastern Scholar) at Shanghai Institutions of Higher Learning(ES2015XX)

Abstract:

The key of current research on behavior recognition algorithms based on deep learning lies in enhancing the accuracy and stability of key point extraction, in order to achieve more accurate action recognition of targets. However, many current algorithms tend to just add attention mechanisms that appear to perform better in the feature extraction stage of the target, without considering the impact of different attention mechanisms on different models and tasks. Therefore, this study proposes an algorithmic model for pose estimation based on various attention mechanisms, which further highlights the importance of selecting an appropriate attention mechanism by comparing the impact of different attention mechanisms on the model. In addition, considering the stability of key point extraction, the initialization of the model is fine-tuned to select a more suitable initialization method that improves the performance by increasing the category of weights on network layer judgments. Compared with the performance of the benchmark network model, the model enhances all evaluation metrics on both multiscale and no-multiscale CrowdPose datasets, where the average accuracy improvement in both cases is more than 1%.

Key words: behavior recognition, pose estimation, computer vision, graph convolution network, key points, HRNet, attention mechanism, average precision

CLC Number: 

  • TP391