DocumentCode :
2246536
Title :
A multiple deformable template approach for visual speech recognition
Author :
Chandramohan, Devi ; Silsbee, Peter L.
Author_Institution :
Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA
Volume :
1
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
50
Abstract :
Proposes an improved deformable template algorithm for modeling the shape of a talker´s mouth. We use a two-step approach which begins by classifying mouth images into broad categories. The classification procedure yields both a set of template parameters (in effect, a unique template) and a set of initial conditions. The second step is to allow the deformable template to converge using standard techniques. The multi-model approach is significantly more flexible than single-model approaches and consistently provides better solutions. We present examples of single and multiple template solutions which support this statement. In a small recognition experiment, recognition of consonants improved from 16% to 33%, based only on visual information, when multiple templates were used
Keywords :
image classification; speech recognition; consonant recognition; convergence; initial conditions; mouth image classification; multi-model approach; multiple deformable template approach; talker´s mouth shape modelling; template parameters; visual information; visual speech recognition; Constraint optimization; Convergence; Deformable models; Image analysis; Image converters; Image segmentation; Mouth; Shape; Solid modeling; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607022
Filename :
607022
Link To Document :
بازگشت