DocumentCode :
980537
Title :
Using One-Class SVMs and Wavelets for Audio Surveillance
Author :
Rabaoui, Asma ; Davy, Manuel ; Rossignol, Stéphane ; Ellouze, Noureddine
Author_Institution :
Unite de Rech. Signal, ENIT, Tunis
Volume :
3
Issue :
4
fYear :
2008
Firstpage :
763
Lastpage :
775
Abstract :
This paper presents a method aimed at recognizing environmental sounds for surveillance and security applications. We propose to apply one-class support vector machines (1-SVMs) together with a sophisticated dissimilarity measure in order to address audio classification, and more specifically, sound recognition. We illustrate the performance of this method on an audio database, which consists of 1015 sounds belonging to nine classes. The database used presents high intraclass diversity in temps of signal properties and some kind of interclass similarities. A large discrepancy in the number of items in each class implies nonuniform probability of sound appearances. The method proceeds as follows: first, the use of a set of state-of-the-art audio features is studied. Then, we introduce a set of novel features obtained by combining elementary features. Experiments conducted on a nine-class classification problem show the superiority of this novel sound recognition method. The best recognition accuracy (96.89%) is obtained when combining wavelet-based features, MFCCs, and individual temporal and frequency features. Our 1-SVM-based multiclass classification approach overperforms the conventional hidden Markov model-based system in the experiments conducted, the improvement in the error rate can reach 50%. Besides, we provide empirical results showing that the single-class SVM outperforms a combination of binary SVMs. Additional experiments demonstrate our method is robust to environmental noise.
Keywords :
audio signal processing; probability; security; signal classification; support vector machines; surveillance; wavelet transforms; MFCCs; audio classification; audio database; audio surveillance; environmental sounds recognition; nonuniform probability; one-class support vector machines; security applications; wavelets; Audio databases; Data security; Diversity reception; Error analysis; Frequency; Hidden Markov models; Spatial databases; Support vector machine classification; Support vector machines; Surveillance; Audio surveillance; feature combinations; one-class SVM (1-SVM); robust features; sound recognition; wavelets;
fLanguage :
English
Journal_Title :
Information Forensics and Security, IEEE Transactions on
Publisher :
ieee
ISSN :
1556-6013
Type :
jour
DOI :
10.1109/TIFS.2008.2008216
Filename :
4668357
Link To Document :
بازگشت