Journal of Xidian University ›› 2022, Vol. 49 ›› Issue (6): 76-85.doi: 10.19665/j.issn1001-2400.2022.06.010

• Computer Science and Technology & Artificial Intelligence • Previous Articles     Next Articles

Online classification jointed RGBT tracking based on the dual attention Siamese network

ZHANG Zhaoyu1(),TIAN Chunna1(),ZHOU Heng1(),TIAN Xilan2()   

  1. 1. School of Electronic Engineering,Xidian University,Xi’an 710071,China
    2. Digital Technology R & D Center,the 38th Research Institute of China Electronics Technology Group Corporation,Hefei 230088,China
  • Received:2021-10-27 Online:2022-12-20 Published:2023-02-09

Abstract:

The imaging mechanism of visible light and that of thermal infrared are different.Visible light and thermal infrared images reflect different information on the object.A dual-modal visual tracker based on visible light and thermal infrared sequences can comprehensively utilize the inherent correlation and complementarity of two modals,which reduces limitations and uncertainties of single-modal information,and improves the robustness of the visual tracking system.We propose an end-to-end dual-modal tracking algorithm with the Siamese network based on infrared and visible light sequences.The network learns the depth features from the visible light and thermal infrared frames at the same time,and then adaptively fuses the two-model features through intra-modal and cross-modal dual attention mechanisms,which leads to more robust tracking.At the same time,in view of the insufficiency of the Siamese network in distinguishing the target and semantic background,we incorporate the online classification module into the tracking framework.The online learned classifier reduces the interference and adapts to the target changes during tracking.According to experimental results,the proposed algorithm effectively improves the performance of the tracker.Its precision rate and success rate are 90.6% and 73.8% on the RGBT benchmark dataset GTOT,which are 5.5% and 4.3% higher than those of the baseline algorithm.The overall performance is better than that of other advanced tracking algorithms.

Key words: object tracking, RGB/Thermal infrared, Siamese network, attention mechanism, deep learning

CLC Number: 

  • TP391