Electronic Science and Technology ›› 2024, Vol. 37 ›› Issue (4): 77-86.doi: 10.16180/j.cnki.issn1007-7820.2024.04.011

Previous Articles     Next Articles

Research on Fast 3D Hand Keypoint Detection Algorithm Based on Anchor

QIN Xiaofei, HE Wen, BAN Dongxian, GUO Hongyu, YU Jing   

  1. School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China
  • Received:2022-12-17 Online:2024-04-15 Published:2024-04-19
  • Supported by:
    National Natural Science Foundation of China(92048205);China Scholarship(202008310014)

Abstract:

In human-robotcollaboration tasks, hand key point detection provides target point coordinates for the robotic arm.A2J(Anchor-to-Joint) is a representative method of key point detection using anchor points.A2J can achieve better detection effect with depth map input, but it has insufficient ability to acquire global features.In this study, a GLF(Global-Local Feature Fusion) module is designed to fuse the shallow and deep features of the backbone network.In order to improve the detection speed, the backbone network of A2J is replaced with ShuffleNetv2 and reformed, and 3×3 depth separable convolution is replaced with 5×5 depth separable convolution to increase the sensitivity field and effectively improve the backbone network's ability to extract global features.ECA(Efficient Channel Attention) is introduced into the anchor weight estimation branch to improve the network's attention to important anchor points.The results of training and testing on the mainstream data sets ICVL and NYU show that the average error of the proposed method is reduced by 0.09 mm and 0.15 mm, respectively, compared with A2J.The detection rate of 151 frame·s-1 is realized on GTX1080Ti graphics card, which fully meets the real-time requirement of man-machine collaboration task.

Key words: human-robot collaboration, 3D hand keypoint detection, anchor point, depth map, global-local feature fusion, ShuffleNetv2, depthwise separable convolution, efficient channel attention

CLC Number: 

  • TP391.41