DocumentCode
2246536
Title
A multiple deformable template approach for visual speech recognition
Author
Chandramohan, Devi ; Silsbee, Peter L.
Author_Institution
Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA
Volume
1
fYear
1996
fDate
3-6 Oct 1996
Firstpage
50
Abstract
Proposes an improved deformable template algorithm for modeling the shape of a talker´s mouth. We use a two-step approach which begins by classifying mouth images into broad categories. The classification procedure yields both a set of template parameters (in effect, a unique template) and a set of initial conditions. The second step is to allow the deformable template to converge using standard techniques. The multi-model approach is significantly more flexible than single-model approaches and consistently provides better solutions. We present examples of single and multiple template solutions which support this statement. In a small recognition experiment, recognition of consonants improved from 16% to 33%, based only on visual information, when multiple templates were used
Keywords
image classification; speech recognition; consonant recognition; convergence; initial conditions; mouth image classification; multi-model approach; multiple deformable template approach; talker´s mouth shape modelling; template parameters; visual information; visual speech recognition; Constraint optimization; Convergence; Deformable models; Image analysis; Image converters; Image segmentation; Mouth; Shape; Solid modeling; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607022
Filename
607022
Link To Document