Journal of Xidian University ›› 2019, Vol. 46 ›› Issue (1): 86-92.doi: 10.19665/j.issn1001-2400.2019.01.014

Previous Articles     Next Articles

CFCC feature extraction for fusion of the power-law nonlinearity function and spectral subtraction

BAI Jing,SHI Yanyan(),XUE Peiyun,GUO Qianyan   

  1. College of Information and Computer, Taiyuan University of Technology, Taiyuan 030024, China
  • Received:2018-06-26 Online:2019-02-20 Published:2019-03-05
  • Contact: Yanyan SHI E-mail:690742874@qq.com

Abstract:

This paper presents an improved speech feature extraction algorithm for improving the accuracy of speech recognition in noisy environment. A New Cochlear Filter Cepstral Coefficient(NCFCC) is extracted by the power-law nonlinear function which can simulate the auditory characteristics of the human ear. Then, the spectral subtraction is introduced in the feature extraction front end to enhance the signal, and the new feature and the first order difference are composed of a mixed feature parameter, after which the combined principal component analysis is made to reduce the dimension of the hybrid feature. The final feature is used in a non-specific persons, isolated words, and small-vocabulary speech recognition system. Experimental results show that, compared with the traditional Cochlear Filter Cepstral Coefficients(CFCC) feature, the Cochlear Filter Cepstral Coefficients extracted from the power-law nonlinear function significantly improve the accuracy of speech recognition. The mixed feature parameter can achieve a better speech recognition performance than a single feature. Combined with the feature set of the principal component analysis(PCA) ,the recognition accuracy can reach up to 88.10% when the signal to noise ratio(SNR) is 0 dB.

Key words: speech recognition, power-law nonlinearity function, cochlear filter cepstral coefficients, spectral subtraction

CLC Number: 

  • TN912.34