مرکز منطقه ای اطلاع رساني علوم و فناوري - Improvements in audio classification based on sinusoidal modeling

DocumentCode :

2695219

Title :

Improvements in audio classification based on sinusoidal modeling

Author :

Shirazi, Jalil ; Ghaemmaghami, Shahrokh ; Razzazi, Farbod

Author_Institution :

Islamic Azad Univ., Gonabad

fYear :

2008

fDate :

June 23 2008-April 26 2008

Firstpage :

1485

Lastpage :

1488

Abstract :

In this paper, a set of features is presented and evaluated based on sinusoidal modeling of audio signals. Amplitude, frequency, and phase parameters of the sinusoidal model are used and compared as input features into an audio classifier system. The performance of sinusoidal model features is evaluated for classification of audio into speech and music classes using both the Gaussian and the GMM (Gaussian mixture model) classifiers. Experimental results show superiority of the amplitude parameters of the sinusoidal model, which could be used for the first time for such an audio classification, as compared to the popular cepstral features. By using a set of 40 sinusoidal features, we achieved 95.06% accuracy in the audio classification at frame level, as compared to 92.26% accuracy obtained with the MFCC coefficients, as tested over the same audio corpus.

Keywords :

Gaussian processes; audio signal processing; cepstral analysis; signal classification; Gaussian mixture model; amplitude parameter; audio classification; audio signal; cepstral feature; phase parameter; sinusoidal modeling; Bit rate; Cepstral analysis; Mel frequency cepstral coefficient; Music information retrieval; Signal generators; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo, 2008 IEEE International Conference on

Conference_Location :

Hannover

Print_ISBN :

978-1-4244-2570-9

Electronic_ISBN :

978-1-4244-2571-6

Type :

conf

DOI :

10.1109/ICME.2008.4607727

Filename :

4607727

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2695219