• DocumentCode
    3400999
  • Title

    A multi-class SVM based phonemes classifier based on a trainable confidence measure

  • Author

    Amini, Sahar ; Razzazi, Farbod ; Nayebi, Kambiz

  • Author_Institution
    Electr. Eng. Dept., Islamic Azad Univ., Tehran, Iran
  • fYear
    2009
  • fDate
    14-17 Dec. 2009
  • Firstpage
    49
  • Lastpage
    54
  • Abstract
    Although the recognition results of support vector machines(SVM) are very promising in many applications, there is a gap between the accuracy of SVM based speech recognizers and time series models (e.g. hidden Markov model) in speech recognition. The main reasons are the lack of proper methods to classify the acoustic units into more than two classes and suitable SVM based sequence decoders. This paper describes a trainable method for SVM multi-class classification based on confidence measures of the sets of two-class SVM classifiers using an artificial neural network. In addition, a pruning method has been proposed for SVM multi-class classification to decrease the computational complexity without significant decrease in accuracy. Also, a method has been proposed for time series recognition of the feature vectors of an utterance based on SVM classifiers. The experiments have been conducted on a set of confusable phonemes using TIMIT corpus. The results of the first method show 10% and 6% relative improvements in the recognition rate in comparison to one-versus-one and Kruger methods respectively for /b/, /d/ and /g/ phonemes. In addition, it is empirically deduced that the proposed phoneme classification framework yields significantly better classification rates than classic voting method. Comparing phoneme classification results of the proposed method with one-versus-one method indicate a 26% improvement in the classification rate for /b/, /d/ and /g/ phonemes.
  • Keywords
    speech processing; speech recognition; support vector machines; time series; multi-class SVM based phonemes classifier; sequence decoders; speech recognizers; support vector machines; time series recognition; trainable confidence measure; Support vector machine classification; Support vector machines; Automatic Speech Recognition; Confidence Measure; Multi-Class Support Vector Machines; Neural Network; Time Series Models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Information Technology (ISSPIT), 2009 IEEE International Symposium on
  • Conference_Location
    Ajman
  • Print_ISBN
    978-1-4244-5949-0
  • Type

    conf

  • DOI
    10.1109/ISSPIT.2009.5407477
  • Filename
    5407477