DocumentCode :
573188
Title :
Discriminative sparse-based feature extraction and dictionary learning for sound classification applications
Author :
Seyedin, Sanaz ; Pichevar, Ramin ; Rouat, Jean
Author_Institution :
Dept. de Genie Electr. et de Genie Inf., Univ. de Sherbrooke, Sherbrooke, QC, Canada
fYear :
2012
fDate :
2-5 July 2012
Firstpage :
1330
Lastpage :
1335
Abstract :
This paper presents a novel sparse-based classification algorithm for audio applications such as sound classification. We propose performing the sparse feature extraction, the dictionary learning, and classification processes simultaneously. This discriminative learning procedure for adapting the dictionaries and classifier to each specified audio task, instead of employing pre-defined dictionaries is the main novelty of our work. According to our experiments, applying this algorithm on some Mel-scale spectral features, such as MFCC (Mel Frequency Cepstral Coefficient), instead of raw temporal data can improve the accuracy and execution time significantly. Our proposed discriminative MFCC-sparse features when evaluated on real data consisting of five audio classes, substantially out-performed the non-discriminative ones. The lengths of test segments in our method are less than 0.5 second. This potential of usage for real-time applications is another advantage of our proposed approach.
Keywords :
audio signal processing; dictionaries; feature extraction; learning (artificial intelligence); signal classification; Mel frequency cepstral coefficient; Mel-scale spectral features; audio applications; classification processes; dictionaries; dictionary learning; discriminative MFCC-sparse features; discriminative learning; discriminative sparse-based feature extraction; execution time; sound classification; sparse feature extraction; sparse-based classification; temporal data; Accuracy; Dictionaries; Feature extraction; Harmonic analysis; Mel frequency cepstral coefficient; Speech; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-0381-1
Electronic_ISBN :
978-1-4673-0380-4
Type :
conf
DOI :
10.1109/ISSPA.2012.6310500
Filename :
6310500
Link To Document :
بازگشت