DocumentCode :
2524413
Title :
Audio classification based on sinusoidal model: A new feature
Author :
Shirazi, Jalil ; Ghaemmaghami, Shahrokh
Author_Institution :
Sci. & Res. Branch, Islamic Azad Univ., Tehran
fYear :
2008
fDate :
19-21 Nov. 2008
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, a new feature set is presented and evaluated based on sinusoidal modeling of audio signals. Duration of the longest sinusoidal model frequency track, as a measure of the harmony, is used and compared to typical features as input into an audio classifier. The performance of this sinusoidal model feature is evaluated through classification of audio to speech and music using both the GMM and the SVM classifiers. Classification results show the proposed feature, which could be used for the first time in such an audio classification, is quite successful in speech/music classification. Experimental comparisons with popular features for audio classification, such as HZCRR and LSTER, are presented and discussed. By using a set of three features, extracted from 1-second segments of the signal, we achieved 94.32% accuracy in the audio classification.
Keywords :
Gaussian processes; audio signal processing; feature extraction; music; signal classification; speech processing; support vector machines; GMM; SVM classifier; audio classification; audio signal processing; feature extraction; music classification; sinusoidal model; speech classification; Data mining; Feature extraction; Frequency measurement; Multimedia systems; Music information retrieval; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON 2008 - 2008 IEEE Region 10 Conference
Conference_Location :
Hyderabad
Print_ISBN :
978-1-4244-2408-5
Electronic_ISBN :
978-1-4244-2409-2
Type :
conf
DOI :
10.1109/TENCON.2008.4766393
Filename :
4766393
Link To Document :
بازگشت