• DocumentCode
    3558891
  • Title

    A Low-Complexity Parabolic Lip Contour Model With Speaker Normalization for High-Level Feature Extraction in Noise-Robust Audiovisual Speech Recognition

  • Author

    Borgstr?¶m, Bengt Jonas ; Alwan, Abeer

  • Author_Institution
    Dept. of Electr. Eng., Univ. of California, Los Angeles, CA
  • Volume
    38
  • Issue
    6
  • fYear
    2008
  • Firstpage
    1273
  • Lastpage
    1280
  • Abstract
    This paper proposes a novel low-complexity lip contour model for high-level optic feature extraction in noise-robust audiovisual (AV) automatic speech recognition systems. The model is based on weighted least-squares parabolic fitting of the upper and lower lip contours, does not require the assumption of symmetry across the horizontal axis of the mouth, and is therefore realistic. The proposed model does not depend on the accurate estimation of specific facial points, as do other high-level models. Also, we present a novel low-complexity algorithm for speaker normalization of the optic information stream, which is compatible with the proposed model and does not require parameter training. The use of the proposed model with speaker normalization results in noise robustness improvement in AV isolated-word recognition relative to using the baseline high-level model.
  • Keywords
    feature extraction; speech recognition; high-level optic feature extraction; isolated word recognition; low-complexity algorithm; low-complexity parabolic lip contour model; lower lip contours; noise robustness; noise-robust audiovisual automatic speech recognition; optic information stream; speaker normalization; weighted least-squares parabolic fitting; Audio-visual speech recognition; feature extraction; noise-robust speech recognition; weighted least-squares;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4427
  • Type

    jour

  • DOI
    10.1109/TSMCA.2008.2003486
  • Filename
    4652747