• DocumentCode
    2895879
  • Title

    Is Phoneme Level Better than Word Level for HMM Models in Limited Vocabulary ASR Systems?

  • Author

    Alotaibi, Yousef Ajami

  • Author_Institution
    Comput. Eng. Dept., King Saud Univ., Riyadh, Saudi Arabia
  • fYear
    2010
  • fDate
    12-14 April 2010
  • Firstpage
    332
  • Lastpage
    337
  • Abstract
    In this paper Arabic alphadigits were investigated from the speech recognition problem point of view. Limited vocabulary Arabic Automatic Speech Recognition Systems (ASRs) were designed, implemented, and tested by using isolated word utterances which consists of Arabic alphabets and/or digits. These systems were implemented separately by using phoneme level and word level based HMM models in distinct systems. The systems were isolated word speech recognizers with a database with size of 16,200 tokens in total that was created depending on 67 Arabic native speakers. Experiment were designed and investigated with different HMM models and different levels of viewing the sound units. Phoneme level and word level models were compared along with Arabic digits alone, alphabets alone, and, then, with both digits and alphabet. In all cases the whole system accuracies and individual utterance accuracies were generated and compared. This research showed that the phoneme level HMM models are superior for limited vocabulary ASR.
  • Keywords
    hidden Markov models; natural language processing; speech recognition; Arabic digits; HMM models; automatic speech recognition; hidden Markov model; isolated word utterances; limited vocabulary ASR systems; phoneme level; word level; Automatic speech recognition; Educational institutions; Hidden Markov models; Information technology; Natural languages; Shape; Speech recognition; Testing; Vocabulary; Writing; Alphadigits; Arabic; HMM; Recognition; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: New Generations (ITNG), 2010 Seventh International Conference on
  • Conference_Location
    Las Vegas, NV
  • Print_ISBN
    978-1-4244-6270-4
  • Type

    conf

  • DOI
    10.1109/ITNG.2010.73
  • Filename
    5501707