Research and Application of LSTM and GRU in Urban Sound Classification

SUN Chenying; SHEN Xizhong

doi:10.3969/j.issn.2096-3424.2020.02.008

SUN Chenying, SHEN Xizhong. Research and Application of LSTM and GRU in Urban Sound Classification[J]. Journal of Technology, 2020, 20(2): 158-164. DOI: 10.3969/j.issn.2096-3424.2020.02.008

Citation:

SUN Chenying, SHEN Xizhong. Research and Application of LSTM and GRU in Urban Sound Classification[J]. Journal of Technology, 2020, 20(2): 158-164. DOI: 10.3969/j.issn.2096-3424.2020.02.008

Citation:

SUN Chenying, SHEN Xizhong. Research and Application of LSTM and GRU in Urban Sound Classification[J]. Journal of Technology, 2020, 20(2): 158-164. DOI: 10.3969/j.issn.2096-3424.2020.02.008

Research and Application of LSTM and GRU in Urban Sound Classification

Graphical Abstract

Graphical Abstract

Abstract

Abstract

Different types of sounds have different effects on the quality of physical and mental health of urban residents. Accurate classification of urban sounds is conducive to effective evaluation of them, thus promoting the management of urban sounds. Deep learning has been applied in speech recognition, among which the recurrent neural network (RNN) is the most prominent. Due to the obvious gradient disappearance, large network loss and low accuracy of the basic RNN, the improved recurrent neural network was employed to classify the urban background noise. The long short-term memory neural network (LSTM) and the gated recurrent unit (GRU) neural network were used to construct a deep-circulating neural network model. The accuracy of the constructed deep neural network was tested and analyzed by the public data set UrbanSound8K. The model was based on the benchmark of the Mel frequency cepstral coefficient and the results were significantly improved compared with the basic RNN.