Title :
Syllabic Markov models of Arabic HMMs of spoken Arabic using CV units
Author :
Ingleby, Michael ; Baothman, Fatmah
Author_Institution :
Inf. Syst. Dept., King Abdulaziz Univ., Jeddah, Saudi Arabia
Abstract :
We survey evidence - orthographic distributional phonological and psycholinguistic - in favor of a model of Arabic speech sounds based on the CV unit and extensive use of the silent sukuun vowel. We then construct a small-vocabulary multi-speaker CV HMM similar to the phonemic HMMs based on tied triphones that are widely used in speech recognizers for English and other European languages. Using experimental measures of recognition accuracy and trainability, we demonstrate that the CV type of model outperforms a standard tied triphone recognizer for Arabic speech, using Cohen´s kappa ration for statistical comparison. Finally we argue that models based on CV units may also lead to better stemmers, spell-checkers and other natural language processing tools for Arabic.
Keywords :
computational linguistics; hidden Markov models; natural language processing; speech recognition; statistical analysis; vocabulary; Arabic HMM; Arabic speech sounds; CV units; Cohen kappa ration; English language; European language; hidden Markov models; natural language processing tools; orthographic distributional phonological model; phonemic HMM; psycholinguistic model; silent sukuun vowel; small-vocabulary multispeaker CV HMM; speech recognizers; spell-checkers; spoken Arabic; standard tied triphone recognizer; statistical comparison; stemmers; syllabic Markov models; Accuracy; Hidden Markov models; Natural language processing; Speech; Speech recognition; Training; Vocabulary; Hidden Markov Models; kappa ratio; speech recognizers; syllabic components; testing speaker independence; tied triphone models;
Conference_Titel :
Information Science and Technology (CIST), 2014 Third IEEE International Colloquium in
Conference_Location :
Tetouan
Print_ISBN :
978-1-4799-5978-5
DOI :
10.1109/CIST.2014.7016628