Journal of Xidian University ›› 2019, Vol. 46 ›› Issue (6): 9-16.doi: 10.19665/j.issn1001-2400.2019.06.002

Previous Articles     Next Articles

Urban sound event classification with the N-order dense convolutional network

CAO Yi1,2,HUANG Zilong1,2,ZHANG Wei1,2,LIU Chen1,2,LI Wei3   

  1. 1. School of Mechanical Engineering, Jiangnan University, Wuxi 214122, China
    2. Jiangsu Key Laboratory of Advanced Food Manufacturing Equipment and Technology, Wuxi 214122, China
    3. Suzhou Instiute of Industrial Technology, Suzhou 215104, China
  • Received:2019-05-15 Online:2019-12-20 Published:2019-12-21

Abstract:

An urban sound event classification model based on the N-order Dense Convolutional Network (abbreviated to N-DenseNet) is proposed for the problems of insufficient classification accuracy and generalization ability of existing models. First, the network structure of the DenseNet is briefly introduced. Then, dense connection in the DenseNet is improved by N-order state-dependent connection based on the N-order Markov model. Furthermore, combining advantages of both the DenseNet and N-order Markov, a novel network architecture, i.e., the N-DenseNet, is proposed in this paper. Theoretically, the N-DenseNet satisfying the premise of alleviating vanishing-gradient, can not only produce efficient integration of feature information from the layers, but also accelerate the convergence speed. Finally, in order to validate advantages of the new model, 1-DenseNet and 2-DenseNet are respectively exploited in the urban sound event classification based on the UrbanSound8K and Dcase2016 dataset. Experimental results show that the accuracy of the two above-mentioned models is respectively 83.63% and 81.03%, which also demonstrates a higher classification accuracy and a better generalization performance of the N-DenseNet.

Key words: sound event classification, dense convolutional network, N-order Markov model, N-order dense convolutional

CLC Number: 

  • TP391.42