Journal of Xidian University ›› 2020, Vol. 47 ›› Issue (2): 16-22.doi: 10.19665/j.issn1001-2400.2020.02.003
Previous Articles Next Articles
WANG Xiaohua,YAO Pengchao,MA Liping,WANG Wenjie,ZHANG Lei
Received:
2019-08-02
Online:
2020-04-20
Published:
2020-04-26
CLC Number:
WANG Xiaohua,YAO Pengchao,MA Liping,WANG Wenjie,ZHANG Lei. Algorithm for extraction of features of robot speech control in the factory environment[J].Journal of Xidian University, 2020, 47(2): 16-22.
"
噪声 类型 | 特征提 取算法 | 信噪比/dB | |||||
---|---|---|---|---|---|---|---|
-5 | 0 | 5 | 10 | 15 | 20 | ||
白噪声 | MFCC | 35.25 | 41.24 | 62.05 | 71.71 | 80.24 | 85.25 |
GFCC | 40.25 | 48.35 | 64.25 | 76.25 | 85.65 | 87.75 | |
TEOGFCC | 48.81 | 65.21 | 74.24 | 80.20 | 87.21 | 90.20 | |
TEOGFCC+ΔTEOGFCC | 50.64 | 67.32 | 77.58 | 84.38 | 89.01 | 91.14 | |
粉红噪声 | MFCC | 36.70 | 42.13 | 63.08 | 72.31 | 81.25 | 86.69 |
GFCC | 41.25 | 49.05 | 66.21 | 75.65 | 84.36 | 88.82 | |
TEOGFCC | 49.25 | 66.32 | 74.06 | 80.35 | 88.31 | 90.05 | |
TEOGFCC+ΔTEOGFCC | 51.88 | 68.24 | 77.86 | 85.63 | 90.32 | 91.78 | |
汽车噪声 | MFCC | 34.24 | 42.25 | 63.58 | 71.12 | 81.33 | 86.96 |
GFCC | 43.21 | 50.26 | 68.32 | 76.30 | 85.65 | 87.24 | |
TEOGFCC | 47.65 | 63.45 | 73.26 | 80.09 | 87.87 | 90.74 | |
TEOGFCC+ΔTEOGFCC | 50.24 | 67.69 | 78.35 | 84.25 | 89.81 | 91.65 | |
车间噪声 | MFCC | 35.58 | 43.28 | 65.36 | 73.25 | 82.16 | 85.23 |
GFCC | 42.32 | 51.25 | 67.48 | 75.91 | 86.51 | 87.24 | |
TEOGFCC | 48.04 | 61.68 | 72.55 | 81.48 | 86.61 | 91.34 | |
TEOGFCC+ΔTEOGFCC | 51.64 | 68.28 | 79.85 | 85.45 | 90.81 | 92.08 |
[1] | ISHI C T, ISHIGURO H, HAGITA N . Analysis of Head Motions and Speech, and Head Motion Control in an Android Robot[C]// Geminoid Studies: Science and Technologies for Humanlike Teleoperated Androids. Singapore: Springer Singapore, 2018: 89-110. |
[2] | PIRES J N, AZAR A S . Advances in Robotics for Additive/Hybrid Manufacturing: Robot Control, Speech Interface and Path Planning[J]. Industrial Robot, 2018,45(3):311-327. |
[3] | MAGASSOUBA A, BERTIN N, CHAUMETTE F . Exploiting the Distance Information of the Interaural Level Difference for Binaural Robot Motion Control[J]. IEEE Robotics and Automation Letters, 2018,3(3):2048-2055. |
[4] | KUMAR A, ROUT S S, GOEL V . Speech Mel Frequency Cepstral Coefficient Feature Classification Using Multi Level Support Vector Machine[C]// Proceedings of the 2017 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics. Piscataway: IEEE, 2017: 134-138. |
[5] | SHAHIN I . Emotion Recognition Based on Third-order Circular Suprasegmental Hidden Markov Model[C]// Proceedings of the 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology. Piscataway: IEEE, 2019: 800-805. |
[6] | ISHIMITSU S . Speech Recognition Method and Speech Recognition Apparatus: US6067513A[P]. 2000 -05-23. |
[7] | TAZI E B . A Robust Speaker Identification System Based on the Combination of GFCC and MFCC Methods[C]// Proceedings of the 2017 International Conference on Multimedia Computing and Systems. Washington: IEEE Computer Society, 2017: 54-58. |
[8] | CAN G, AKBAS C E, CETIN A E . Recognition of Vessel Acoustic Signatures Using Non-linear Teager Energy Based Features[C]// Proceedings of the 2016 International Workshop on Computational Intelligence for Multimedia Understanding. Piscataway: IEEE, 2016: 7801190. |
[9] | 白静, 史燕燕, 薛珮芸 , 等. 融合非线性幂函数和谱减法的CFCC特征提取[J]. 西安电子科技大学学报, 2019,46(1):86-92. |
BAI Jing, SHI Yanyan, XUE Peiyun , et al. CFCC Feature Extraction for Fusion of the Power-law Nonlinearity Function and Spectral Subtraction[J]. Journal of Xidian University, 2019,46(1):86-92. | |
[10] | MARKOVIĆ B, GALIĆ J, GROZDIĆ D , et al. Whispered Speech Recognition Based on Gammatone Filterbank Cepstral Coefficients[J]. Journal of Communications Technology and Electronics, 2017,62(11):1255-1261. |
[11] | BANDELA S R, KUMAR T K . Stressed Speech Emotion Recognition Using Feature Fusion of Teager Energy Operator and MFCC[C]// Proceedings of the 2017 8th International Conference on Computing, Communications and Networking Technologies. Piscataway: IEEE, 2017: 8204149. |
[12] | MONTEIRO R L D C, PEREIRA V, COSTA H G . Analysis of the Better Life Index trough a Cluster Algorithm[J]. Social Indicators Research, 2019,142(2):477-506. |
[13] | TONG Z, SHI D, YANG S . Scene SLAM: a SLAM Framework Combined with Scene Detection[C]// Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics. Piscataway: IEEE, 2017: 1-8. |
[14] | KORSHUNOV P, GONÇALVES A R, VIOLATO R P V , et al. On the Use of Convolutional Neural Networks for Speech Presentation Attack Detection[C]// Proceedings of the 2018 IEEE 4th International Conference on Identity, Security, and Behavior Analysis. Piscataway: IEEE, 2018: 1-8. |
[15] | 张雪英, 牛溥华, 高帆 . 基于DNN-LSTM的VAD算法[J]. 清华大学学报(自然科学版), 2018,58(5):509-515. |
ZHANG Xueying, NIU Puhua, GAO Fan . VAD Algorithm Based on DNN-LSTM[J]. Journal of Tsinghua University (Science and Technology), 2018,58(5):509-515. |
[1] | LIU Jieyi,GONG Maoguo,ZHAN Tao,LI Hao,ZHANG Mingyang. Method for discrimination of false targets in multistation radar systems based on the deep neural network [J]. Journal of Xidian University, 2021, 48(2): 133-138. |
[2] | WEI Ziyu,YANG Xi,WANG Nannan,YANG Dong,GAO Xinbo. Reciprocal bi-directional generative adversarial network for cross-modal pedestrian re-identification [J]. Journal of Xidian University, 2021, 48(2): 205-212. |
[3] | GUO Zekun,TIAN Long,HAN Ning,WANG Penghui,LIU Hongwei,CHEN Bo. Radar HRRP based few-shot target recognition with CNN-SSD [J]. Journal of Xidian University, 2021, 48(2): 7-14. |
[4] | HAN Zhuoxi,WANG Feng,CHEN Pei,LI Zhuolun. Fuzzy data association algorithm assisted by historical features [J]. Journal of Xidian University, 2021, 48(2): 92-98. |
[5] | CHENG Tianyou,LIN Yanping,MA Xiaojun. Ultrasound probe guiding method using vision and force [J]. Journal of Xidian University, 2020, 47(1): 80-87. |
[6] | WANG Hongyan,ZHU Limin,ZHANG Panjie,LI Jinping. Method for the detection of the piston side defect based on external contour registration [J]. Journal of Xidian University, 2019, 46(5): 75-83. |
[7] | GUO Rui,ZANG Bo,PENG Shuming,XING Mengdao. Extraction of features of the urban high-rise building from high resolution InSAR data [J]. Journal of Xidian University, 2019, 46(4): 137-143. |
[8] | LIU Daohua,CUI Yushuang,ZHAO Yansong,SONG Yuting,WANG Jinghui. Method for retrieving the teaching image based on the improved convolutional neural network [J]. Journal of Xidian University, 2019, 46(3): 52-58. |
[9] | YANG Jiangong,WANG Xili,LIU Shigang. Spectral-spatial classification of hyperspectral images using deep Boltzmann machines [J]. Journal of Xidian University, 2019, 46(3): 109-115. |
[10] | WANG Xinyuan,XIAO Song,LI Lei,JIAO Lingling. Robust target tracking algorithm based on the ELM and discriminative correlation filter [J]. Journal of Xidian University, 2019, 46(1): 57-63. |
[11] | WANG Xing;HUANG Xiaoyu;LIU Xuanpu;KONG Xianguang;NIU Meng. Multi-layer incremental feature extraction method for industrial big data [J]. Journal of Xidian University, 2018, 45(4): 106-111. |
[12] | HE Wangpeng;SUN Wei;SU Bo;YANG Yunyi;GUO Baolong. Sparse feature extraction technique and its applications to machinery fault diagnosis [J]. Journal of Xidian University, 2018, 45(2): 154-159. |
[13] | ZHANG Yuhe;GENG Guohua;WEI Xiaoran;SHI Chenchen;ZHANG Shunli. Feature extraction of point clouds using the DBSCAN clustering [J]. Journal of Xidian University, 2017, 44(2): 114-120. |
[14] | ZHANG Jujie;FANG Min;GUO Jin. New method for multi-label feature extraction [J]. Journal of Xidian University, 2016, 43(6): 62-67. |
[15] | KONG Xianguang;ZHANG Xiong;MA Hongbo;CHANG Jiantao;NIU Meng. Real time feature extraction method for complex industrial big data [J]. Journal of Xidian University, 2016, 43(5): 70-74+152. |
|