DocumentCode :
2782898
Title :
Audio classification based on sparse coefficients
Author :
Zubair, S. ; Wenwu Wang
Author_Institution :
Centre for Vision, Speech & Signal Process., Univ. of Surrey, Guildford, UK
fYear :
2011
fDate :
27-29 Sept. 2011
Firstpage :
1
Lastpage :
5
Abstract :
Audio signal classification is usually done using conventional signal features such as mel-frequency cepstrum coefficients (MFCC), line spectral frequencies (LSF), and short time energy (STM). Learned dictionaries have been shown to have promising capability for creating sparse representation of a signal and hence have a potential to be used for the extraction of signal features. In this paper, we consider to use sparse features for audio classification from music and speech data. We use the K-SVD algorithm to learn separate dictionaries for the speech and music signals to represent their respective subspaces and use them to extract sparse features for each class of signals using Orthogonal Matching Pursuit (OMP). Based on these sparse features, Support Vector Machines (SVM) are used for speech and music classification. The same signals were also classified using SVM based on the conventional MFCC coefficients and the classification results were compared to those of sparse coefficients. It was found that at lower signal to noise ratio (SNR), sparse coefficients give far better signal classification results as compared to the MFCC based classification.
Keywords :
audio signal processing; feature extraction; iterative methods; speech processing; support vector machines; time-frequency analysis; K-SVD algorithm; LSF; MFCC coefficients; OMP; SNR; STM; SVM; audio signal classification; line spectral frequencies; mel-frequency cepstrum coefficients; music classification; music data; orthogonal matching pursuit; short time energy; signal feature extraction; signal features; signal to noise ratio; sparse coefficients; sparse representation; speech classification; speech data; support vector machines;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Sensor Signal Processing for Defence (SSPD 2011)
Conference_Location :
London
Electronic_ISBN :
978-1-84919-661-1
Type :
conf
DOI :
10.1049/ic.2011.0153
Filename :
6253409
Link To Document :
بازگشت