DocumentCode :
3137987
Title :
Visual speech recognition of Modern Classic Arabic language
Author :
Damien, Pascal
fYear :
2011
fDate :
6-7 June 2011
Firstpage :
50
Lastpage :
55
Abstract :
Viseme-based Visual Speech Recognition (VSR) systems, using Hidden Markov Models (HMM) for phoneme recognition, generally use 3-state left-right HMM for each viseme to recognize. In this article, we propose a novel approach introducing a consonant-vowel detector and using two classifiers: an HMM based classifier for the recognition of the “consonant part” of the phoneme and a classifier for the “vowel part”. The benefits of such an approach include (1) reducing the number of hidden states and (2) reducing the number of HMMs. We tested our method on a limited set of words of the Modern Classic Arabic language and achieved a recognition rate of 81.7%. Moreover, the proposed model is speaker-independent and uses visemes as the basic units, thereby, making it applicable to any set of words of varying size or content.
Keywords :
hidden Markov models; natural language processing; speech recognition; HMM based classifier; VSR system; consonant part recognition; consonant-vowel detector; hidden Markov model; modern classic Arabic language; phoneme recognition; recognition rate; viseme-based visual speech recognition; vowel part recognition; Copper; Hidden Markov models; Lips; Mouth; Speech recognition; Visualization; Vocabulary; Arabic language; Viseme; Visual speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Humanities, Science & Engineering Research (SHUSER), 2011 International Symposium on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4577-0263-1
Type :
conf
DOI :
10.1109/SHUSER.2011.6008499
Filename :
6008499
Link To Document :
بازگشت