Title :
A New SVM-based Mix Audio Classification
Author :
Mahale, P.M.B. ; Rashidi, Mahsa ; Faez, Karim ; Sayadiyan, Abolghasem
Author_Institution :
Amirkabir Univ. of Technol., Tehran
Abstract :
A preprocessing stage in every speech/music applications including separation, recognition and transcription task is inevitable to determine each frame belongs to which classes, namely: speech only, music only or speech/music mixture. Such classification can significantly decrease the computational burden due to exhaustive search commonly introduced as a problem in model-based speech recognition or separation as well as music transcription scenarios. In this paper, we present a new method to separate mixed type audio frames based on support vector machine (SVM). The challenging problem in this work is seeking the most appropriate features to discriminate these classes. As a result, we propose some novel features based on eigen- decomposition which presents acceptable classification result. The experimental results show that the proposed system outperforms other classification systems including k nearest neighbor (k-NN), multi-layer perceptron (MLP).
Keywords :
audio signals; eigenvalues and eigenfunctions; signal classification; support vector machines; eigendecomposition; mix audio classification; mixed type audio frames; support vector machine; Acoustic noise; Cepstrum; Mel frequency cepstral coefficient; Multilayer perceptrons; Music information retrieval; Nearest neighbor searches; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines; Eigen ratio; KNN; MLP; RBF; SVM;
Conference_Titel :
System Theory, 2008. SSST 2008. 40th Southeastern Symposium on
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-1806-0
Electronic_ISBN :
0094-2898
DOI :
10.1109/SSST.2008.4480219