Title :
Environmental sounds recognition using TESPAR
Author :
You, Guanyu ; Li, Ying
Author_Institution :
College of Mathematics and Computer Science, Fuzhou University, China
Abstract :
Environmental sounds depict the sound content of varieties of creatures´ survival and activities, and also closely related with the human living environment. Current conventional approaches for recognition of environmental sounds required important computational resources and employing complex signal processing methods in the frequency domain. This work proposes a low-complexity method named Time Encoded Signal Processing and Recognition (TESPAR for short). The computational requirements for this method are two orders of magnitude less than that required by other usual methods. We used the TESPAR coding method to produce simple data structures, and then used the archetypes technique for classification. Our method was tested on two databases, database 1 consisted of 10 classes of bird sounds to test the interspecific recognition, database 2 consisted of 10 classes of different environmental sounds to test intraspecific recognition. We also did the experiments on the same databases using MFCC and SVM to make a comparison. Results showed that TESPAR has lower training time complexity than SVM, and the recognition rate of intraspecific recognition was better than interspecific recognition.
Keywords :
Linde-Buzo-Gray vector quantization; Time Encoded Signal Processing And Recognition; archetypes; environmental sounds recogntion; interspecific recognition; intraspecific recognition;
Conference_Titel :
Image and Signal Processing (CISP), 2012 5th International Congress on
Conference_Location :
Chongqing, Sichuan, China
Print_ISBN :
978-1-4673-0965-3
DOI :
10.1109/CISP.2012.6469781