• DocumentCode
    2869927
  • Title

    A novel lip localization method based on shiftable wavelets transform

  • Author

    Yanjun, XU ; Limin, Du ; Ziqiang, Hou

  • Author_Institution
    Inst. of Acoust., Acad. Sinica, Beijing, China
  • Volume
    2
  • fYear
    1998
  • fDate
    1998
  • Firstpage
    1029
  • Abstract
    Visual feature extraction is one of the most important techniques in audiovisual bimodal speech recognition, and also remains a very challenging area in image understanding. A shiftable multiscale transform is introduced into the construction of an active shape model. It uses the pyramidal data to describe the structure of an image, which is invariant to illumination and perspective variability and thus contributes a lot to the improvement of the robustness of the model. A segmental downhill simplex method is also put forward to improve the minimization procedure of lip localization. It employs a kind of “coarse-to-fine” strategy to speed up the convergence and improve the robustness of lip localization. Experiments support the validity of the new method, and show better robustness and higher efficiency
  • Keywords
    computational geometry; convergence of numerical methods; feature extraction; image resolution; minimisation; speech recognition; wavelet transforms; active shape model construction; audiovisual bimodal speech recognition; convergence; image structure; image understanding; lip localization; minimization; pyramidal data; segmental downhill simplex method; shiftable multiscale transform; shiftable wavelet transform; visual feature extraction; Acoustic waves; Active shape model; Covariance matrix; Deformable models; Eigenvalues and eigenfunctions; Feature extraction; Lighting; Noise robustness; Speech; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Proceedings, 1998. ICSP '98. 1998 Fourth International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    0-7803-4325-5
  • Type

    conf

  • DOI
    10.1109/ICOSP.1998.770790
  • Filename
    770790