Visual speech recognition of Modern Classic Arabic language

Author

Damien, Pascal

fYear

2011

fDate

6-7 June 2011

Firstpage

50

Lastpage

55

Abstract

Viseme-based Visual Speech Recognition (VSR) systems, using Hidden Markov Models (HMM) for phoneme recognition, generally use 3-state left-right HMM for each viseme to recognize. In this article, we propose a novel approach introducing a consonant-vowel detector and using two classifiers: an HMM based classifier for the recognition of the “consonant part” of the phoneme and a classifier for the “vowel part”. The benefits of such an approach include (1) reducing the number of hidden states and (2) reducing the number of HMMs. We tested our method on a limited set of words of the Modern Classic Arabic language and achieved a recognition rate of 81.7%. Moreover, the proposed model is speaker-independent and uses visemes as the basic units, thereby, making it applicable to any set of words of varying size or content.

Keywords

hidden Markov models; natural language processing; speech recognition; HMM based classifier; VSR system; consonant part recognition; consonant-vowel detector; hidden Markov model; modern classic Arabic language; phoneme recognition; recognition rate; viseme-based visual speech recognition; vowel part recognition; Copper; Hidden Markov models; Lips; Mouth; Speech recognition; Visualization; Vocabulary; Arabic language; Viseme; Visual speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Humanities, Science & Engineering Research (SHUSER), 2011 International Symposium on

Conference_Location

Kuala Lumpur

Print_ISBN

978-1-4577-0263-1

Type

conf

DOI

10.1109/SHUSER.2011.6008499

Filename

6008499