Author/Authors :
Peerapol Khunarsal، نويسنده , , Chidchanok Lursinsap، نويسنده , , Thanapant Raicharoen، نويسنده ,
Abstract :
Environmental sounds are unstructured and similar to noise. However, the recognition of environmental sounds can benefit crime investigations, warning systems for elderly persons, and security systems. A few past research projects were developed for classifying the environmental sounds. In this paper, we proposed an environmental sound classification algorithm using spectrogram pattern matching along with neural network and k-nearest neighbor (k-NN) classifiers. Unlike other techniques, our approach is based on the observation that local features are more important than global features. In addition, our technique can avoid the problem of filtering less informative and irrelevant frequencies in the classification step. Twenty types of sound from BBC and Sound Ideas databases, with each sound sample longer than 10 min, were tested with our algorithm. The spectrogram feature was compared with mel frequency delta cepstral coefficient (MFCC), linear prediction coefficient (LPC), and matching pursuit (MP) features. Two relevant factors concerning the accuracy of classification, window size and sampling rate, were also investigated to find the suitable value of each factor. We also investigated all combinations of these features. Using the k-NN classifier, the maximum accuracy of 94.98% occurred when the spectrogram, LPC, and MP features were combined.
Keywords :
Environmental sound recognition , spectrogram , Spectrogram pattern matching , k-Nearest neighbourneighbor (k-NN)