• DocumentCode
    2246536
  • Title

    A multiple deformable template approach for visual speech recognition

  • Author

    Chandramohan, Devi ; Silsbee, Peter L.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    50
  • Abstract
    Proposes an improved deformable template algorithm for modeling the shape of a talker´s mouth. We use a two-step approach which begins by classifying mouth images into broad categories. The classification procedure yields both a set of template parameters (in effect, a unique template) and a set of initial conditions. The second step is to allow the deformable template to converge using standard techniques. The multi-model approach is significantly more flexible than single-model approaches and consistently provides better solutions. We present examples of single and multiple template solutions which support this statement. In a small recognition experiment, recognition of consonants improved from 16% to 33%, based only on visual information, when multiple templates were used
  • Keywords
    image classification; speech recognition; consonant recognition; convergence; initial conditions; mouth image classification; multi-model approach; multiple deformable template approach; talker´s mouth shape modelling; template parameters; visual information; visual speech recognition; Constraint optimization; Convergence; Deformable models; Image analysis; Image converters; Image segmentation; Mouth; Shape; Solid modeling; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607022
  • Filename
    607022