• DocumentCode
    3199162
  • Title

    A New SVM-based Mix Audio Classification

  • Author

    Mahale, P.M.B. ; Rashidi, Mahsa ; Faez, Karim ; Sayadiyan, Abolghasem

  • Author_Institution
    Amirkabir Univ. of Technol., Tehran
  • fYear
    2008
  • fDate
    16-18 March 2008
  • Firstpage
    198
  • Lastpage
    202
  • Abstract
    A preprocessing stage in every speech/music applications including separation, recognition and transcription task is inevitable to determine each frame belongs to which classes, namely: speech only, music only or speech/music mixture. Such classification can significantly decrease the computational burden due to exhaustive search commonly introduced as a problem in model-based speech recognition or separation as well as music transcription scenarios. In this paper, we present a new method to separate mixed type audio frames based on support vector machine (SVM). The challenging problem in this work is seeking the most appropriate features to discriminate these classes. As a result, we propose some novel features based on eigen- decomposition which presents acceptable classification result. The experimental results show that the proposed system outperforms other classification systems including k nearest neighbor (k-NN), multi-layer perceptron (MLP).
  • Keywords
    audio signals; eigenvalues and eigenfunctions; signal classification; support vector machines; eigendecomposition; mix audio classification; mixed type audio frames; support vector machine; Acoustic noise; Cepstrum; Mel frequency cepstral coefficient; Multilayer perceptrons; Music information retrieval; Nearest neighbor searches; Speech analysis; Speech recognition; Support vector machine classification; Support vector machines; Eigen ratio; KNN; MLP; RBF; SVM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Theory, 2008. SSST 2008. 40th Southeastern Symposium on
  • Conference_Location
    New Orleans, LA
  • ISSN
    0094-2898
  • Print_ISBN
    978-1-4244-1806-0
  • Electronic_ISBN
    0094-2898
  • Type

    conf

  • DOI
    10.1109/SSST.2008.4480219
  • Filename
    4480219