Title :
Audio classification based on sinusoidal model: A new feature
Author :
Shirazi, Jalil ; Ghaemmaghami, Shahrokh
Author_Institution :
Sci. & Res. Branch, Islamic Azad Univ., Tehran
Abstract :
In this paper, a new feature set is presented and evaluated based on sinusoidal modeling of audio signals. Duration of the longest sinusoidal model frequency track, as a measure of the harmony, is used and compared to typical features as input into an audio classifier. The performance of this sinusoidal model feature is evaluated through classification of audio to speech and music using both the GMM and the SVM classifiers. Classification results show the proposed feature, which could be used for the first time in such an audio classification, is quite successful in speech/music classification. Experimental comparisons with popular features for audio classification, such as HZCRR and LSTER, are presented and discussed. By using a set of three features, extracted from 1-second segments of the signal, we achieved 94.32% accuracy in the audio classification.
Keywords :
Gaussian processes; audio signal processing; feature extraction; music; signal classification; speech processing; support vector machines; GMM; SVM classifier; audio classification; audio signal processing; feature extraction; music classification; sinusoidal model; speech classification; Data mining; Feature extraction; Frequency measurement; Multimedia systems; Music information retrieval; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines; Time frequency analysis;
Conference_Titel :
TENCON 2008 - 2008 IEEE Region 10 Conference
Conference_Location :
Hyderabad
Print_ISBN :
978-1-4244-2408-5
Electronic_ISBN :
978-1-4244-2409-2
DOI :
10.1109/TENCON.2008.4766393