DocumentCode :
2695219
Title :
Improvements in audio classification based on sinusoidal modeling
Author :
Shirazi, Jalil ; Ghaemmaghami, Shahrokh ; Razzazi, Farbod
Author_Institution :
Islamic Azad Univ., Gonabad
fYear :
2008
fDate :
June 23 2008-April 26 2008
Firstpage :
1485
Lastpage :
1488
Abstract :
In this paper, a set of features is presented and evaluated based on sinusoidal modeling of audio signals. Amplitude, frequency, and phase parameters of the sinusoidal model are used and compared as input features into an audio classifier system. The performance of sinusoidal model features is evaluated for classification of audio into speech and music classes using both the Gaussian and the GMM (Gaussian mixture model) classifiers. Experimental results show superiority of the amplitude parameters of the sinusoidal model, which could be used for the first time for such an audio classification, as compared to the popular cepstral features. By using a set of 40 sinusoidal features, we achieved 95.06% accuracy in the audio classification at frame level, as compared to 92.26% accuracy obtained with the MFCC coefficients, as tested over the same audio corpus.
Keywords :
Gaussian processes; audio signal processing; cepstral analysis; signal classification; Gaussian mixture model; amplitude parameter; audio classification; audio signal; cepstral feature; phase parameter; sinusoidal modeling; Bit rate; Cepstral analysis; Mel frequency cepstral coefficient; Music information retrieval; Signal generators; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2008 IEEE International Conference on
Conference_Location :
Hannover
Print_ISBN :
978-1-4244-2570-9
Electronic_ISBN :
978-1-4244-2571-6
Type :
conf
DOI :
10.1109/ICME.2008.4607727
Filename :
4607727
Link To Document :
بازگشت