J4 ›› 2015, Vol. 42 ›› Issue (3): 135-140+191.doi: 10.3969/j.issn.1001-2400.2015.03.023

• Original Articles • Previous Articles     Next Articles

Implementation and optimization of the wideband matched filter on the GPU

ZHOU Hang;CAI Zhiming;WANG Ximin   

  1. (School of Electronic Engineering, Naval Univ. of Engineering, Wuhan  430033, China)
  • Received:2014-05-04 Revised:2014-05-28 Online:2015-06-20 Published:2015-07-27
  • Contact: ZHOU Hang E-mail:zhh06@163.com

Abstract:

The fine estimation of wideband ambiguity, which has a sharp main ridge, requires large amounts of searching on the time-scale. That desperately needs the well-optimized software on high performance hardware. In terms of wideband correlation, the matched filter based on the CWT and its fast algorithm based on the FFT are studied, and furthermore its complexity is analyzed. Then a reconfigurable implementation on the GPU is proposed, and a method of optimization that combines analysis with testing is proposed. By optimizing the dimension of the thread block and utilizing texture memory, the time of the kernel is reduced; the CUDA library is introduced, so the delays of the FFT and maximum searching are reduced. In comparison with the method in the 8-core CPU, the proposed method improves the overall performance up to 3.3 times. The speed can meet the challenge of real-time processing of the wideband matched filter.

Key words: signal processing, parallel computing, graphics processing unit (GPU), program optimization, continuous wavelet transform (CWT)