• DocumentCode
    387504
  • Title

    Achieving real-time lip-synch via SVM-based phoneme classification and lip shape refinement

  • Author

    Kim, Taeyoon ; Kang, Yongsung ; Ko, Hanseok

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Korea Univ., Seoul, South Korea
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    299
  • Lastpage
    304
  • Abstract
    In this paper, we develop a real time lip-synch system that activates a 2D avatar\´s lip motion in synch with incoming speech utterance. To realize "real time" operation of the system, we contain the processing time by invoking a merge and split procedure performing coarse-to-fine phoneme classification. At each stage of phoneme classification, we apply a support vector machine (SVM) to constrain the computational load while attaining desirable accuracy. Coarse-to-fine phoneme classification is accomplished via 2 stages of feature extraction, where each speech frame is acoustically analyzed first for 3 classes of lip opening using MFCC as the feature and then a further refined classification for detailed lip shape using formant information. We implemented the system with 2D lip animation that shows the effectiveness of the proposed 2-stage procedure accomplishing the real-time lip-synch task.
  • Keywords
    acoustic signal processing; computer animation; feature extraction; learning automata; real-time systems; signal classification; speech recognition; 2D avatar lip motion; 2D lip animation; acoustic analysis; feature extraction; incoming speech utterance; lip opening; lip shape refinement; merge and split procedure; processing time; real time lip synch system; speech frame; support vector machine based coarse-to-fine phoneme classification; Animation; Feature extraction; Information analysis; Mel frequency cepstral coefficient; Multimedia systems; Real time systems; Shape; Speech analysis; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimodal Interfaces, 2002. Proceedings. Fourth IEEE International Conference on
  • Print_ISBN
    0-7695-1834-6
  • Type

    conf

  • DOI
    10.1109/ICMI.2002.1167010
  • Filename
    1167010