DocumentCode :
2430067
Title :
Comparative experiments to evaluate the use of syllables for the automatic recognition of arabic spoken names in noisy environment
Author :
Azmi, Mohamed M. ; Tolba, Hesham
Author_Institution :
Alexandria Higher Inst. of Eng., Alexandria
fYear :
2008
fDate :
7-11 June 2008
Firstpage :
440
Lastpage :
444
Abstract :
This paper addresses the problem of noise robustness of automatic speech recognition (ASR) systems, using a hybrid technique: a speech pre-processing enhancement technique and the use of the syllables as the acoustic units for the ASR process. The speech pre-processing enhancement technique was accomplished by the use of the Ephraim-Malah filter. We tested our system using a database that consists of spoken Arabic names in noisy environments. This is achieved by the use of an HMM-based statistical recognition engine. Comparative experiments show that the syllable-based recognition outperforms the monophone- and triphone-based recognition in noisy environments. The HTK hidden Markov model toolkit was used throughout our experiments. Results show that the recognition rate obtained in noisy environments using syllables, outperformed the rates obtained using both triphones and monophones by 5.79% and 39.72%, respectively. On the other hand, with the integration of the Ephraim-Malah filter in the front-end of our syllable-based ASR system, we show through experiments that, the recognition rate using syllables outperformed the rate obtained using triphones and monophones by 6.58% and 39.72%, respectively.
Keywords :
filtering theory; hidden Markov models; natural language processing; speech enhancement; speech recognition; statistical analysis; Arabic spoken name; Ephraim-Malah filter; automatic speech recognition; hidden Markov model; noisy environment; speech preprocessing enhancement; statistical recognition; syllable-based recognition; Acoustic noise; Acoustic testing; Automatic speech recognition; Filters; Hidden Markov models; Noise robustness; Speech enhancement; Speech processing; System testing; Working environment noise; Arabic Language; Robust ASR; Syllables;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks and Signal Processing, 2008 International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-2310-1
Electronic_ISBN :
978-1-4244-2311-8
Type :
conf
DOI :
10.1109/ICNNSP.2008.4590389
Filename :
4590389
Link To Document :
بازگشت