• DocumentCode
    2426563
  • Title

    Syllable-based automatic arabic speech recognition in noisy enviroment

  • Author

    Azmi, Mohamed M. ; Tolba, Hesham

  • Author_Institution
    Alexandria Higher Inst. of Eng. & Technol., Alexandria
  • fYear
    2008
  • fDate
    7-9 July 2008
  • Firstpage
    1436
  • Lastpage
    1441
  • Abstract
    In this paper, syllables are proposed to be used as acoustic units to improve the performance of automatic speech recognition (ASR) systems of Arabic spoken proverbs in noisy environments. To test our proposed approach, a speaker-independent HMM-based speech recognition system was designed using hidden Markov model toolkit (HTK). A series of experiments on noisy speech has been carried out using an Arabic database that consists of fifty-nine Egyptian speakers. The obtained results show that the recognition rate using syllables outperformed the rate obtained using monophones and triphones by 20.88% and 15.82%, respectively. The use of syllables did not only improve the performance of the ASR process in noisy environments, but also it limited the complexity of the computation (and consequently the running time) of the recognition process. Also, we show in this paper that the integration of a pre-processing enhancement technique in the front-end of the syllable-based ASR engine leads to an improvement of the recognition rate by 20.88% and 15.82%, compared to the rates obtained using monophones and triphone-based ASR, respectively.
  • Keywords
    hidden Markov models; speech enhancement; speech recognition; Arabic spoken proverbs; HMM-based speech recognition; hidden Markov model toolkit; noisy environment; pre-processing enhancement technique; speaker-independent speech recognition; syllable-based automatic Arabic speech recognition; triphone-based ASR; Acoustic noise; Acoustical engineering; Automatic speech recognition; Dictionaries; Engines; Hidden Markov models; Noise robustness; Pattern matching; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-1723-0
  • Electronic_ISBN
    978-1-4244-1724-7
  • Type

    conf

  • DOI
    10.1109/ICALIP.2008.4590209
  • Filename
    4590209