• DocumentCode
    1819668
  • Title

    An online speech driven talking head system

  • Author

    Kai Zhao ; Zhiyong Wu ; Jia Jia ; Lianhong Cai

  • Author_Institution
    Tsinghua-CUHK Joint Res. Center for Media Sci. Technol., Tsinghua Univ., Shenzhen, China
  • fYear
    2012
  • fDate
    18-20 Nov. 2012
  • Firstpage
    186
  • Lastpage
    187
  • Abstract
    This paper presents the design and implementation of an online speech driven talking head animation system. The system first recognizes phoneme sequence from the input speech with a Chinese Mandarin speech recognizer. The phoneme sequence is further transformed to a sequence of visemes. The sequence of MPEG-4 facial animation parameters (FAPs) is further derived from the viseme sequence, and is used to drive the facial animations on a 3-dimentional talking head. The architecture and the major features are also presented in the paper, together with the evaluations of the system.
  • Keywords
    computer animation; natural language processing; speech recognition; speech synthesis; 3-dimentional talking head; Chinese Mandarin speech recognizer; FAP; MPEG-4 facial animation parameters; input speech; online speech driven talking head animation system; phoneme sequence; phoneme sequence recognition; viseme sequence; facial animation parameters (FAPs); talking head; viseme; visual speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Global High Tech Congress on Electronics (GHTCE), 2012 IEEE
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4673-5086-0
  • Type

    conf

  • DOI
    10.1109/GHTCE.2012.6490153
  • Filename
    6490153