مرکز منطقه ای اطلاع رساني علوم و فناوري - Audio classification based on sinusoidal model: A new feature

DocumentCode :

2524413

Title :

Audio classification based on sinusoidal model: A new feature

Author :

Shirazi, Jalil ; Ghaemmaghami, Shahrokh

Author_Institution :

Sci. & Res. Branch, Islamic Azad Univ., Tehran

fYear :

2008

fDate :

19-21 Nov. 2008

Firstpage :

Lastpage :

Abstract :

In this paper, a new feature set is presented and evaluated based on sinusoidal modeling of audio signals. Duration of the longest sinusoidal model frequency track, as a measure of the harmony, is used and compared to typical features as input into an audio classifier. The performance of this sinusoidal model feature is evaluated through classification of audio to speech and music using both the GMM and the SVM classifiers. Classification results show the proposed feature, which could be used for the first time in such an audio classification, is quite successful in speech/music classification. Experimental comparisons with popular features for audio classification, such as HZCRR and LSTER, are presented and discussed. By using a set of three features, extracted from 1-second segments of the signal, we achieved 94.32% accuracy in the audio classification.

Keywords :

Gaussian processes; audio signal processing; feature extraction; music; signal classification; speech processing; support vector machines; GMM; SVM classifier; audio classification; audio signal processing; feature extraction; music classification; sinusoidal model; speech classification; Data mining; Feature extraction; Frequency measurement; Multimedia systems; Music information retrieval; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines; Time frequency analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

TENCON 2008 - 2008 IEEE Region 10 Conference

Conference_Location :

Hyderabad

Print_ISBN :

978-1-4244-2408-5

Electronic_ISBN :

978-1-4244-2409-2

Type :

conf

DOI :

10.1109/TENCON.2008.4766393

Filename :

4766393

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2524413