• DocumentCode
    457486
  • Title

    Automatic Lipreading with Limited Training Data

  • Author

    Wang, S.L. ; Lau, W.H. ; Liew, A.W.C. ; Leung, S.H.

  • Author_Institution
    Sch. of Info. Security Eng., Shanghai Jiaotong Univ.
  • Volume
    3
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    881
  • Lastpage
    884
  • Abstract
    Speech recognition solely based on visual information such as the lip shape and its movement is referred to as lipreading. This paper presents an automatic lipreading technique for speaker dependent (SD) and speaker independent (SI) speech recognition tasks. Since the visual features are derived according to the frame rate of the video sequence, spline representation is then employed to translate the discrete-time sampled visual features into continuous domain. The spline coefficients in the same word class are constrained to have similar expression and can be estimated from the training data by the EM algorithm. In addition, an adaptive multi-model approach is proposed to overcome the variation caused by different speaking style in speaker-independent recognition task. The experiments are carried out to recognize the ten English digits and an accuracy of 96% for speaker dependent recognition and 88% for speaker independent recognition have been achieved, which shows the superiority of our approach compared with other classifiers investigated
  • Keywords
    expectation-maximisation algorithm; speech recognition; splines (mathematics); video signal processing; EM algorithm; automatic lipreading; speaker dependent; speaker independent; speech recognition; spline representation; video sequence; visual features; visual information; Automatic speech recognition; Data mining; Data security; Hidden Markov models; Information security; Shape; Speech recognition; Spline; Training data; Video sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-2521-0
  • Type

    conf

  • DOI
    10.1109/ICPR.2006.301
  • Filename
    1699666