• DocumentCode
    1516737
  • Title

    An Image-Based Visual Speech Animation System

  • Author

    Zhou, Ziheng ; Zhao, Guoying ; Guo, Yimo ; Pietikäinen, Matti

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Oulu, Oulu, Finland
  • Volume
    22
  • Issue
    10
  • fYear
    2012
  • Firstpage
    1420
  • Lastpage
    1432
  • Abstract
    An image-based visual speech animation system is presented in this paper. A video model is proposed to preserve the video dynamics of a talking face. The model represents a video sequence by a low-dimensional continuous curve embedded in a path graph and establishes a map from the curve to the image domain. When selecting video segments for synthesis, we loosen the traditional requirement of using triphone as the unit to allow segments to contain longer natural talking motion. Dense videos are sampled from the segments, concatenated, and downsampled to train a video model that enables efficient time alignment and motion smoothing for the final video synthesis. Different viseme definitions are used to investigate the impact of visemes on the video realism of the animated talking face. The system is built on a public database and tested both objectively and subjectively.
  • Keywords
    computer animation; face recognition; graph theory; image segmentation; image sequences; video signal processing; visual databases; animated talking face; image segmentation; image-based visual speech animation system; low-dimensional continuous curve; motion smoothing; natural talking motion; path graph; public database; time alignment; triphone; video dynamics; video realism; video sequence; video synthesis; viseme definitions; Animation; Databases; Face; Hidden Markov models; Image segmentation; Speech; Vectors; Graph representation; lip-syncing; talking face; video-realistic; visual speech animation (VSA);
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2012.2199399
  • Filename
    6200317