Title :
Improved morphological decomposition for Arabic broadcast news transcription
Author :
Ng, Tim ; Nguyen, Kham ; Zbib, Rabih ; Nguyen, Long
Author_Institution :
BBN Technol., Cambridge, MA
Abstract :
In this paper, we show the progress for Arabic speech recognition by incorporating contextual information into the process of morphological decomposition. The new approach achieves lower out-of-vocabulary and word error rates when compared to our previous work, in which the morphological decomposition relies on word-level information only. We also describe how the vocalization procedure is improved to produce pronunciations for some dialect Arabic words. By using the new approach, we reduced the word error by 0.8% absolute (4.7% relative) when compared to the baseline approach.
Keywords :
speech processing; speech recognition; Arabic broadcast news transcription; Arabic speech recognition; contextual information; dialect Arabic words; morphological decomposition; out-of-vocabulary; pronunciations; vocalization procedure; word error rates; word-level information; Arabic; Speech recognition; morphological decomposition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960582