DocumentCode
975504
Title
Recognition of visual speech elements using adaptively boosted hidden Markov models
Author
Foo, Say Wei ; Lian, Yong ; Dong, Liang
Author_Institution
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore
Volume
14
Issue
5
fYear
2004
fDate
5/1/2004 12:00:00 AM
Firstpage
693
Lastpage
705
Abstract
The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov models (HMMs) to build an AdaBoost-HMM classifier. The composite HMMs of the AdaBoost-HMM classifier are trained to cover different groups of training samples using the AdaBoost technique and the biased Baum-Welch training method. By combining the decisions of the component classifiers of the composite HMMs according to a novel probability synthesis rule, a more complex decision boundary is formulated than using the single HMM classifier. The method is applied to the recognition of the basic visual speech elements. Experimental results show that the AdaBoost-HMM classifier outperforms the traditional HMM classifier in accuracy, especially for visemes extracted from contexts.
Keywords
gesture recognition; hidden Markov models; pattern classification; probability; speech recognition; video signal processing; Baum-Welch training method; adaptively boosted hidden Markov models; automatic lip reading; automatic speech recognition systems; computational complexity; decision boundary; probability synthesis rule; spread-out distribution; time series distribution; visemes identification; visual speech element recognition; visual speech processing; Automatic speech recognition; Boosting; Hidden Markov models; Lips; Speech enhancement; Speech recognition; Speech synthesis; Teeth; Tongue; Working environment noise;
fLanguage
English
Journal_Title
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher
ieee
ISSN
1051-8215
Type
jour
DOI
10.1109/TCSVT.2004.826773
Filename
1294960
Link To Document