DocumentCode
3137987
Title
Visual speech recognition of Modern Classic Arabic language
Author
Damien, Pascal
fYear
2011
fDate
6-7 June 2011
Firstpage
50
Lastpage
55
Abstract
Viseme-based Visual Speech Recognition (VSR) systems, using Hidden Markov Models (HMM) for phoneme recognition, generally use 3-state left-right HMM for each viseme to recognize. In this article, we propose a novel approach introducing a consonant-vowel detector and using two classifiers: an HMM based classifier for the recognition of the “consonant part” of the phoneme and a classifier for the “vowel part”. The benefits of such an approach include (1) reducing the number of hidden states and (2) reducing the number of HMMs. We tested our method on a limited set of words of the Modern Classic Arabic language and achieved a recognition rate of 81.7%. Moreover, the proposed model is speaker-independent and uses visemes as the basic units, thereby, making it applicable to any set of words of varying size or content.
Keywords
hidden Markov models; natural language processing; speech recognition; HMM based classifier; VSR system; consonant part recognition; consonant-vowel detector; hidden Markov model; modern classic Arabic language; phoneme recognition; recognition rate; viseme-based visual speech recognition; vowel part recognition; Copper; Hidden Markov models; Lips; Mouth; Speech recognition; Visualization; Vocabulary; Arabic language; Viseme; Visual speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Humanities, Science & Engineering Research (SHUSER), 2011 International Symposium on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4577-0263-1
Type
conf
DOI
10.1109/SHUSER.2011.6008499
Filename
6008499
Link To Document