• DocumentCode
    327652
  • Title

    Accurate, real-time, unadorned lip tracking

  • Author

    Kaucic, Robert ; Blake, Andrew

  • Author_Institution
    Dept. of Eng. Sci., Oxford Univ., UK
  • fYear
    1998
  • fDate
    4-7 Jan 1998
  • Firstpage
    370
  • Lastpage
    375
  • Abstract
    Human speech is inherently multi-modal, consisting of both audio and visual components. Recently researchers have shown that the incorporation of information about the position of the lips into acoustic speech recognisers enables robust recognition of noisy speech. In the case of Hidden Markov Model-recognition, we show that this happens because the visual signal stabilises the alignment of states. It is also shown, that unadorned lips, both the inner and outer contours, can be robustly tracked in real time on general-purpose workstations. To accomplish this, efficient algorithms are employed which contain three key components: shape models, motion models, and focused colour feature detectors-all of which are learnt from examples
  • Keywords
    image recognition; motion estimation; speech recognition; Hidden Markov Model-recognition; acoustic speech recognisers; colour feature detectors; lip tracking; motion models; noisy speech; robust recognition; shape models; visual signal; Acoustic noise; Computer vision; Hidden Markov models; Humans; Lips; Motion detection; Robustness; Shape; Speech recognition; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 1998. Sixth International Conference on
  • Conference_Location
    Bombay
  • Print_ISBN
    81-7319-221-9
  • Type

    conf

  • DOI
    10.1109/ICCV.1998.710745
  • Filename
    710745