Journal of Xidian University ›› 2019, Vol. 46 ›› Issue (4): 130-136.doi: 10.19665/j.issn1001-2400.2019.04.018

Previous Articles     Next Articles

Speech enhancement method based on the time-frequency smoothing deep neural network

YUAN Wenhao,LIANG Chunyan,LOU Yingxi,FANG Chao,WANG Zhiqiang   

  1. School of Computer Science and Technology, Shandong University of Technology, Zibo 255000, China
  • Received:2019-03-22 Online:2019-08-20 Published:2019-08-15

Abstract:

In the existing speech enhancement methods based on the deep neural network, the characteristics of speech enhancement problem are not fully considered in the design of the network structure. In view of this problem, based on the different characteristics of speech enhancement in time and frequency, inspired by the feature calculation method in the traditional speech enhancement methods, a time-frequency smoothing network with different processings in time and frequency is designed. In this network, a gated recurrent unit is used to express the correlation of noisy speech with time, and a convolutional neural network is used to express the correlation of the noisy speech with frequency, which realizes a time-frequency smoothing process similar to that of the traditional speech enhancement methods. Experimental results show that the proposed time-frequency smoothing network can significantly improve the speech enhancement performance compared with other networks under the premise of ensuring the causality of the speech enhancement system and that the enhanced speech has a better speech quality and intelligibility.

Key words: speech enhancement, time-frequency smoothing, convolutional neural network, deep neural network

CLC Number: 

  • TN912.3