[1] |
LOIZOU P C . Speech Enhancement: Theory and Practice[M]. Boca Raton: CRC Press, 2013.
|
[2] |
WANG D L, CHEN J . Supervised Speech Separation Based on Deep Learning: An Overview[J]. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2018,26(10):1702-1726.
|
[3] |
GRAIS E M, SEN M U, ERDOGAN H . Deep Neural Networks for Single Channel Source Separation[C]//Proceedings of the 2014 IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2014: 3734-3738.
|
[4] |
WANG Y, NARAYANAN A, WANG D L . On Training Targets for Supervised Speech Separation[J]. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2014,22(12):1849-1858.
|
[5] |
XU Y, DU J, DAI L R , et al. An Experimental Study on Speech Enhancement Based on Deep Neural Networks[J]. IEEE Signal Processing Letters, 2014,21(1):65-68.
|
[6] |
XU Y, DU J, DAI L R , et al. A Regression Approach to Speech Enhancement Based on Deep Neural Networks[J]. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2015,23(1):7-19.
|
[7] |
CHEN J, WANG D . Long Short-term Memory for Speaker Generalization in Supervised Speech Separation[J]. Journal of the Acoustical Society of America, 2017,141(6):4705-4714.
|
[8] |
SUN L, DU J, DAI L R , et al. Multiple-target Deep Learning for LSTM-RNN Based Speech Enhancement[C]//Proceedings of the 2017 Hands-free Speech Communications and Microphone Arrays. Piscataway: IEEE, 2017: 136-140.
|
[9] |
LIU M, WANG Y, WANG J , et al. Speech Enhancement Method Based on LSTM Neural Network for Speech Recognition[C]//Proceedings of the 2019 International Conference on Signal Processing. Piscataway: IEEE, 2019: 245-249.
|
[10] |
PANDA A . Speech Enhancement Using RNN Models and Ideal Exponential Mask[C]//Proceedings of the 2018 IEEE Applied Signal Processing Conference. Piscataway: IEEE, 2018: 178-182.
|
[11] |
KOUNOVSKY T, MALEK J . Single Channel Speech Enhancement Using Convolutional Neural Network[C]//Proceedings of the 2017 IEEE International Workshop of Electronics, Control, Measurement, Signals and Their Application to Mechatronics. Piscataway: IEEE, 2017: 7945915.
|
[12] |
TAN K, CHEN J, WANG D . Gated Residual Networks with Dilated Convolutions for Monaural Speech Enhancement[J]. IEEE Transactions on Audio, Speech and Language Processing, 2019,27(1):189-198.
|
[13] |
OUYANG Z, YU H, ZHU W P , et al. A Fully Convolutional Neural Network for Complex Spectrogram Processing in Speech Enhancement[C]//Proceedings of the 2019 IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2019: 5756-5760.
|
[14] |
PASCUAL S, BONAFONTE A, SERRA J . SEGAN: Speech Enhancement Generative Adversarial Network[C]//Proceedings of the 2017 18th Annual Conference of the International Speech Communication Association: 2017. Baixas: International Speech Communication Association, 2017: 3642-3646.
|
[15] |
YUAN J, BAO C . Joint Ideal Ratio Mask and Generative Adversarial Networks for Monaural Speech Enhancement[C]//Proceedings of the 2019 International Conference on Signal Processing. Piscataway: IEEE, 2019: 276-280.
|
[16] |
XIANG Y, BAO C . Speech Enhancement via Generative Adversarial LSTM Networks[C]//Proceedings of the 2018 16th International Workshop on Acoustic Signal Enhancement. Piscataway: IEEE, 2018: 46-50.
|
[17] |
HAO X, SHAN C, XU Y , et al. An Attention-based Neural Network Approach for Single Channel Speech Enhancement[C]//Proceedings of the IEEE 2019 International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2019: 6895-6899.
|
[18] |
VASWANI A, SHAZEER N, PARMAR N , et al. Attention Is All You Need[C]//Advances in Neural Information Processing Systems 30: 2017. Vancouver: Neural Information Processing Systems Foundation, 2017: 5999-6009.
|
[19] |
袁文浩, 梁春燕, 娄迎曦 , 等. 一种时频平滑的深度神经网络语音增强方法[J]. 西安电子科技大学学报, 2019,46(4):130-136.
|
|
YUAN Wenhao, LIANG Chunyan, LOU Yingxi , et al. Speech Enhancement Method Based on the Time-frequency Smoothing Deep Neural Network[J]. Journal of Xidian University, 2019,46(4):130-136.
|
[20] |
袁文浩, 娄迎曦, 夏斌 , 等. 基于卷积门控循环神经网络的语音增强方法[J]. 华中科技大学学报(自然科学版), 2019,47(4):13-18.
|
|
YUAN Wenhao, LOU Yingxi, XIA Bin , et al. Speech Enhancement Method Based on Convolutional Gated Recurrent Neural Network[J]. Journal of Huazhong University of Science and Technology:Natural Science Edition, 2019,47(4):13-18.
|
[21] |
RIX A W, BEERENDS J G, HOLLIER M P , et al. Perceptual Evaluation of Speech Quality (PESQ)-a New Method for Speech Quality Assessment of Telephone Networks and Codecs[C]//Proceedings of the 2001 IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2001: 749-752.
|
[22] |
TAAL C H, HENDRIKS R C, HEUSDENS R , et al. An Algorithm for Intelligibility Prediction of Time-frequency Weighted Noisy Speech[J]. IEEE Transactions on Audio, Speech and Language Processing, 2011,19(7):2125-2136.
|