[1] RABINER L R, SAMBUR M R. Voiced-unvoiced-silence Detection Using the Itakura LPC Distance Measure[C]//Proceedings of the IEEE Acoustics, Speech, and Signal Processing. New York: IEEE, 1977: 323-326.
[2] BENGIO Y. Learning Deep Architectures for AI[J]. Foundations and Trends in Machine Learning, 2009, 2(1): 1-127.
[3] LI D. A Tutorial Survey of Architectures, Algorithms, and Applications for Deep Learning[J]. APSIPA Transactions on Signal and Information Processing, 2015, 3: e2.
[4] VINCENT P, LAROCHELLE H, BENGIO Y, et al. Extracting and Composing Robust Features with Denoising Autoencoders[C]//Proceedings of the 25th International Conference on Machine Learning. New York: ACM, 2008: 1096-1103.
[5] BENGIO Y, LAMBLIN P, POPOVICI D, et al. Greedy Layer-wise Training of Deep Networks[C]//Advances in Neural Information Processing Systems. Canada: Neural Information Processing System Foundation, 2007: 153-160.
[6] DENG J, ZHANG Z, MARCHI E, et al. Sparse Autoencoder-based Feature Transfer Learning for Speech Emotion Recognition[C]//Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction. Washington: IEEE Computer Society, 2013: 511-516.
[7] HINTON G E, OSINDERO S, TEH Y W. A Fast Learning Algorithm for Deep Belief Nets[J]. Neural Computation, 2006, 18(7): 1527-1554.
[8] NG A. Sparse Autoencoder[J]. CS294A Lecture Notes, 2011, 72: 1-19.
[9] TIKHONOV A N. Solution of Incorrectly Formulated Problems and the Regularization Method[J]. Soviet Math Dokl, 1962, 5: 1035-1038.
[10] TANG K W, SKORIN-KAPOV J. Training Artificial Neural Networks: Backpropagation via Nonlinear Optimization[J]. Cit Journal of Computing & Information Technology, 2001, 9(1): 1-14.
[11] BARNARD E. Optimization for Training Neural Nets[J]. IEEE Transactions on Neural Networks, 1992, 3(2): 232-240.
[12] SUPPLEE L M, COHN R P, COLLURA J S, et al. MELP: the New Federal Standard at 2400bps[C]//Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing: 2. Piscataway: IEEE, 1997: 1591-1594.
[13] GAROFOLO J S, LAMEL L F, FISHER W M, et al. DARPA TIMIT Acoustic-phonetic Continous Speech Corpus CD-ROM. NIST Speech Disc 1-1. 1: NASA STI/Recon Technical Report N 93, 27403[R]. Washington: NASA, 1993. |