DocumentCode :
387504
Title :
Achieving real-time lip-synch via SVM-based phoneme classification and lip shape refinement
Author :
Kim, Taeyoon ; Kang, Yongsung ; Ko, Hanseok
Author_Institution :
Dept. of Electron. & Comput. Eng., Korea Univ., Seoul, South Korea
fYear :
2002
fDate :
2002
Firstpage :
299
Lastpage :
304
Abstract :
In this paper, we develop a real time lip-synch system that activates a 2D avatar\´s lip motion in synch with incoming speech utterance. To realize "real time" operation of the system, we contain the processing time by invoking a merge and split procedure performing coarse-to-fine phoneme classification. At each stage of phoneme classification, we apply a support vector machine (SVM) to constrain the computational load while attaining desirable accuracy. Coarse-to-fine phoneme classification is accomplished via 2 stages of feature extraction, where each speech frame is acoustically analyzed first for 3 classes of lip opening using MFCC as the feature and then a further refined classification for detailed lip shape using formant information. We implemented the system with 2D lip animation that shows the effectiveness of the proposed 2-stage procedure accomplishing the real-time lip-synch task.
Keywords :
acoustic signal processing; computer animation; feature extraction; learning automata; real-time systems; signal classification; speech recognition; 2D avatar lip motion; 2D lip animation; acoustic analysis; feature extraction; incoming speech utterance; lip opening; lip shape refinement; merge and split procedure; processing time; real time lip synch system; speech frame; support vector machine based coarse-to-fine phoneme classification; Animation; Feature extraction; Information analysis; Mel frequency cepstral coefficient; Multimedia systems; Real time systems; Shape; Speech analysis; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimodal Interfaces, 2002. Proceedings. Fourth IEEE International Conference on
Print_ISBN :
0-7695-1834-6
Type :
conf
DOI :
10.1109/ICMI.2002.1167010
Filename :
1167010
Link To Document :
بازگشت