Title :
Investigations on the use of morpheme level features in Language Models for Arabic LVCSR
Author :
Mousa, Amr El-Desoky ; Schlüter, Ralf ; Ney, Hermann
Author_Institution :
Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
Abstract :
A major challenge for Arabic Large Vocabulary Continuous Speech Recognition (LVCSR) is the rich morphology of Arabic, which leads to high Out-of-vocabulary (OOV) rates, and poor Language Model (LM) probabilities. In such cases, the use of morphemes rather than full-words is considered a better choice for LMs. Thereby, higher lexical coverage and less LM perplexities are achieved. On the other side, an effective way to increase the robustness of LMs is to incorporate features of words into LMs. In this paper, we investigate the use of features derived for morphemes rather than words. Thus, we combine the benefits of both morpheme level and feature rich modeling. We compare the performance of stream-based, class-based and Factored LMs (FLMs) estimated over sequences of morphemes and their features for performing Arabic LVCSR. A relative reduction of 3.9% in Word Error Rate (WER) is achieved compared to a word-based system.
Keywords :
speech recognition; vocabulary; word processing; FLM; LM probability; OOV rate; WER; arabic LVCSR; arabic large vocabulary continuous speech recognition; class-based LM; factored LM; high out-of-vocabulary rate; higher lexical coverage; language model probability; language models; less LM perplexity; morpheme level features; rich morphology; stream-based LM; word error rate; word-based system; Computational modeling; Humans; Interpolation; Lattices; Mathematical model; Speech recognition; USA Councils; class-based; factored; language model; morpheme; stream-based;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289048