[1] 胡波, 肖熙. 检测语音端点及基音的概率模型及方法[J]. 清华大学学报(自然科学版), 2013, 53(6): 749-752.
Hu Bo, Xiao Xi. Endpoint Detection and Pitch Determination Method Based on a Probability Model [J]. Journal of Tsinghua University(Science and Technology), 2013, 53(6): 749-752.
[2] 李远征, 卢朝阳, 李静. 一种基于多特征融合的视频目标跟踪方法[J]. 西安电子科技大学学报, 2012, 39(4): 624-629.
Li Yuanzheng, Lu Chaoyang, Li Jing. A Robust Video Object Tracking Algorithm Based on Multi-feature Fusion [J]. Journal of Xidian University, 2012, 39(4): 624-629.
[3] Ghosh P K, Tsiartas A, Narayanan S. Robust Voice Activity Detection Using Long-Term Signal Variability [J]. IEEE Transactions on Audio, Speech and Language Processing, 2011, 19(3): 601-613.
[4] Cheng Gong, Zhang Xiongwei, Li Yaobo, et al. Voice Activity Detection Method Based on Gray Correlation Analysis Algorithm [J]. Journal of PLA University of Science and Technology, 2012, 56(7): 1014-1022.
[5] Cho N, Kim E K. Enhanced Voice Activity Detection Using Acoustic Event Detection and Classification [J]. IEEE Transactions on Consumer Electronics, 2011, 57(1): 196-202
[6] Chiu Y H B, Raj B, Stern R M. Learning-Based Auditory Encoding for Robust Speech Recognition [J]. IEEE Transactions on Audio, Speech and Language Processing, 2012, 20(3): 900-914. |