[1] NASERSHARIF B, AKBARI A. SNR-dependent Compression of Enhanced Mel Subband Energies for Compensation of Noise Effects on MFCC Features [J]. Pattern Recognition Letters, 2011, 28(11): 1320-1326.
[2] POVEY D, KINGSBURY B, MANGU L, et al. fMPE: Discriminatively Trained Features for Speech Recognition[C]//Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2005: 961-964.
[3] ZHANG B, MATSOUKAS S, SCHWARTZ R. Recent Progress on the Discriminative Region-dependent Transform for Speech Feature Extraction[C]//Proceedings of the 9th International Conference on Spoken Language Processing. Baixas: ISCA, 2006: 1495-1498.
[4] FUKUDA T, ICHIKAWA O, NISHIMURA M, et al. Regularized Feature-space Discriminative Adaptation for Robust ASR[C]//Proceedings of the Annual Conference of the International Speech Communication Association. Baixas: ISCA, 2014: 2185-2188.
[5] POVEY D. Improvements to fMPE for Discriminative Training of Features[C]//Proceedings of the Annual Conference of the International Speech Communication Association. Baixas: ISCA, 2005: 2977-2980.
[6] KARAFIAT M, JANDA M, CERNOCKY J, et al. Region Dependent Linear Transforms in Multilingual Speech Recognition[C]//Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2012: 4885-4888.
[7] DENG L, CHEN J S. Sequence Classification Using the High-Level Features Extracted from Deep Neural Networks [C]//Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2014: 6844-6898.
[8] LIU D Y, WEI S, GUO W, et al. Lattice Based Optimization of Bottleneck Feature Extractor with Linear Transformation[C]//Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2014: 5617-5621.
[9] YAN Z J, HUO Q, XU J, et al. Tied-state Based Discriminative Training of Context-expanded Region-dependent Feature Transforms for LVCSR[C]//Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2013:6940-6944.
[10] ZIBULEVSKY M. L1-L2 Optimization in Signal and Image Processing [J]. IEEE Signal Processing Magazine, 2010, 27(3): 76-88.
[11] EMRE Y, JORT F G, HUGO V H. Noise Robust Exemplar Matching Using Sparse Representations of Speech [J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2014, 22(8): 1306-1319.
[12] ZHANG W B, FUNG P. Discriminatively Trained Sparse Inverse Covariance Matrices for Speech Recognition [J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2014, 22(5): 873-882.
[13] LU L, GHOSHAL A, RENALS S. Regularized Subspace Gaussian Mixture Models for Speech Recognition [J]. IEEE Signal Processing Letters, 2011, 18(7): 419-422.
[14] YU Z, EKAPOL C, JAMES G. Extracting Deep Neural Network Bottleneck Features Using Low-rank Matrix Factorization[C]//Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2014: 185-189.
[15] BECK A, TEBOULLE M. A Fast Iterative Shrinkage-thresholding Algorithm for Linear Inverse Problems [J]. SIAM Journal on Imaging Sciences, 2009, 2(1): 183-202. |