J4 ›› 2014, Vol. 41 ›› Issue (6): 155-159+194.doi: 10.3969/j.issn.1001-2400.2014.06.026

• Original Articles • Previous Articles     Next Articles

Speech enhancement method using self-adaptive time-shift and threshold discrete cosine transform

ZHANG Junchang;LIU Haipeng;FAN Yangyu   

  1.  (School of Electronic Information, Northwestern Polytechnical University, Xi'an  710129, China)
  • Received:2013-10-24 Online:2014-12-20 Published:2015-01-19
  • Contact: ZHANG Junchang E-mail:zhangjc@nwpu.edu.cn

Abstract:

In view of the limitation of the existing speech enhancement methods under a low SNR, this paper proposes a speech enhancement method using self-adaptive time-shift and threshold discrete cosine transform. First, with the improved soft-threshold function to deal with the discrete cosine transform coefficients, we can not only eliminate the noise of noise-dominant frames, but also eliminate the noise of signal-dominant frames; the threshold is also selected self-adaptively based on the SNR, which can largely retain the original characteristics of the speech.Secondly, the shift of the analysis window is self-adapted according to the pitch period, reducing the white noise of the fixed window-shift.And a weighted autocorrelation function is introduced for pitch detection combined by the short-time autocorrelation function and the short-time average magnitude separation function, improving the precision of pitch detection and robustness to noise. Theoretical analysis and simulation results show that the output SNR of this method has increased greatly and the robustness to noise is better when the input SNR is as low as -5dB, compared with the empirical mode decomposition algorithm and the subspace algorithm.

Key words: speech enhancement, discrete cosine transform, adaptive threshold, adaptive time-shift, weighted autocorrelation function