• DocumentCode
    621977
  • Title

    Hybrid SVM/HMM model for the recognition of Arabic triphones-based continuous speech

  • Author

    Zarrouk, Elyes ; Benayed, Yassine ; Gargouri, Faiez

  • Author_Institution
    MIRACL, Multimedia Inf. Syst. & Adv. Comput. Lab., Univ. of Sfax, Sfax, Tunisia
  • fYear
    2013
  • fDate
    18-21 March 2013
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Even if the progress of Hidden Markov Models (HMM) is huge, those models lack a discriminatory ability especially on speech recognition. In order to ameliorate the results of recognition systems, we apply Support Vectors Machine (SVM) as an estimator of posterior probabilities since they are characterized by a high predictive power and discrimination. Moreover, they are based on a structural risk minimization (SRM) where the aim is to set up a classifier that minimizes a bound on the expected risk, rather than the empirical risk. In this paper, we describe the use of the hybrid model SVM/HMM for Arabic triphones-based continuous speech. Furthermore, our work incorporates the stage of preparing language models. It consists in a novel approach for automatic labeling with respect to syntax and grammar rules of the Arabic language. The best results are obtained with the proposed system SVM/HMM when we achieve 76.96% as the best recognition rate of a tested speaker. The speech recognizer was evaluated with ARABIC_DB corpus and performs at 11.42% WER as compared to 13.32% with triphones mixture-Gaussian HMM system.
  • Keywords
    grammars; hidden Markov models; minimisation; natural language processing; pattern classification; probability; risk analysis; speech recognition; support vector machines; ARABIC_DB corpus; Arabic language; Arabic triphones-based continuous speech recognition; SRM; automatic labeling; classifier; expected risk bound minimization; grammar rules; hidden Markov models; hybrid SVM-HMM model; posterior probability estimator; predictive power; structural risk minimization; support vector machine; syntax; Acoustics; Hidden Markov models; Kernel; Labeling; Speech; Speech recognition; Support vector machines; Automatic speech recognition; Support Vector machine; automatic labeling; hybrid system; triphones-based continuous speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Signals & Devices (SSD), 2013 10th International Multi-Conference on
  • Conference_Location
    Hammamet
  • Print_ISBN
    978-1-4673-6459-1
  • Electronic_ISBN
    978-1-4673-6458-4
  • Type

    conf

  • DOI
    10.1109/SSD.2013.6564036
  • Filename
    6564036