• DocumentCode
    302868
  • Title

    Lipreading from color motion video

  • Author

    Chiou, Greg I. ; Hwang, Jenq-Neng

  • Author_Institution
    Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
  • Volume
    4
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    2156
  • Abstract
    We have designed and implemented a lipreading system which recognises isolated words using only color motion video of human lips (without acoustic data). The lipreading system performs color motion video recognition using “snakes” (active contour models), principal component analysis (PCA), and hidden Markov models (HMM). The snake algorithm and PCA are used to extract two sets of visual features from every frame (image) in the video sequence. The snake algorithm looks for contour features in the geometric space, while PCA seeks principal components in the eigenspace. An HMM recognizer is used to train and recognise a sequence of the combined visual features. With the visual information alone, we were able to achieve 94% recognition accuracy for 10 isolated words of a single speaker without using any special marker or lipstick
  • Keywords
    feature extraction; hidden Markov models; image colour analysis; image sequences; motion estimation; speech recognition; video signal processing; HMM; HMM recognizer; active contour models; color motion video recognition; contour features; eigenspace; geometric space; hidden Markov models; isolated word recognition; lipreading system; principal component analysis; recognition accuracy; snake algorithm; snakes; video sequence; visual feature extraction; visual information; Acoustic noise; Active contours; Hidden Markov models; Humans; Lips; Noise cancellation; Principal component analysis; Signal processing; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.545743
  • Filename
    545743