• DocumentCode
    2430067
  • Title

    Comparative experiments to evaluate the use of syllables for the automatic recognition of arabic spoken names in noisy environment

  • Author

    Azmi, Mohamed M. ; Tolba, Hesham

  • Author_Institution
    Alexandria Higher Inst. of Eng., Alexandria
  • fYear
    2008
  • fDate
    7-11 June 2008
  • Firstpage
    440
  • Lastpage
    444
  • Abstract
    This paper addresses the problem of noise robustness of automatic speech recognition (ASR) systems, using a hybrid technique: a speech pre-processing enhancement technique and the use of the syllables as the acoustic units for the ASR process. The speech pre-processing enhancement technique was accomplished by the use of the Ephraim-Malah filter. We tested our system using a database that consists of spoken Arabic names in noisy environments. This is achieved by the use of an HMM-based statistical recognition engine. Comparative experiments show that the syllable-based recognition outperforms the monophone- and triphone-based recognition in noisy environments. The HTK hidden Markov model toolkit was used throughout our experiments. Results show that the recognition rate obtained in noisy environments using syllables, outperformed the rates obtained using both triphones and monophones by 5.79% and 39.72%, respectively. On the other hand, with the integration of the Ephraim-Malah filter in the front-end of our syllable-based ASR system, we show through experiments that, the recognition rate using syllables outperformed the rate obtained using triphones and monophones by 6.58% and 39.72%, respectively.
  • Keywords
    filtering theory; hidden Markov models; natural language processing; speech enhancement; speech recognition; statistical analysis; Arabic spoken name; Ephraim-Malah filter; automatic speech recognition; hidden Markov model; noisy environment; speech preprocessing enhancement; statistical recognition; syllable-based recognition; Acoustic noise; Acoustic testing; Automatic speech recognition; Filters; Hidden Markov models; Noise robustness; Speech enhancement; Speech processing; System testing; Working environment noise; Arabic Language; Robust ASR; Syllables;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks and Signal Processing, 2008 International Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4244-2310-1
  • Electronic_ISBN
    978-1-4244-2311-8
  • Type

    conf

  • DOI
    10.1109/ICNNSP.2008.4590389
  • Filename
    4590389