Title : 
Improvements in audio classification based on sinusoidal modeling
         
        
            Author : 
Shirazi, Jalil ; Ghaemmaghami, Shahrokh ; Razzazi, Farbod
         
        
            Author_Institution : 
Islamic Azad Univ., Gonabad
         
        
        
            fDate : 
June 23 2008-April 26 2008
         
        
        
        
            Abstract : 
In this paper, a set of features is presented and evaluated based on sinusoidal modeling of audio signals. Amplitude, frequency, and phase parameters of the sinusoidal model are used and compared as input features into an audio classifier system. The performance of sinusoidal model features is evaluated for classification of audio into speech and music classes using both the Gaussian and the GMM (Gaussian mixture model) classifiers. Experimental results show superiority of the amplitude parameters of the sinusoidal model, which could be used for the first time for such an audio classification, as compared to the popular cepstral features. By using a set of 40 sinusoidal features, we achieved 95.06% accuracy in the audio classification at frame level, as compared to 92.26% accuracy obtained with the MFCC coefficients, as tested over the same audio corpus.
         
        
            Keywords : 
Gaussian processes; audio signal processing; cepstral analysis; signal classification; Gaussian mixture model; amplitude parameter; audio classification; audio signal; cepstral feature; phase parameter; sinusoidal modeling; Bit rate; Cepstral analysis; Mel frequency cepstral coefficient; Music information retrieval; Signal generators; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines; Testing;
         
        
        
        
            Conference_Titel : 
Multimedia and Expo, 2008 IEEE International Conference on
         
        
            Conference_Location : 
Hannover
         
        
            Print_ISBN : 
978-1-4244-2570-9
         
        
            Electronic_ISBN : 
978-1-4244-2571-6
         
        
        
            DOI : 
10.1109/ICME.2008.4607727