DocumentCode
3317845
Title
Lip movement synthesis in audio-visual speech recognition system
Author
Li, Junquan ; Yin, Yixin
Author_Institution
Univ. of Sci. & Technol., Beijing, China
fYear
2005
fDate
30 Oct.-1 Nov. 2005
Firstpage
461
Lastpage
465
Abstract
This paper describes a technique for synthesizing audio-visual speech based on HMM. The experiment "Lip movement synthesis" use VC++ and HTK toolkit. In the training stage, Japanese words will be change to text from the image sequence from using hidden Markov model (HMM). Experimental results show that the synthetic lip image sequence is smooth and realistic. This research was supported by NSFC, China (Grant No.60374032).
Keywords
audio-visual systems; hidden Markov models; image sequences; natural languages; speech recognition; HMM; HTK toolkit; VC++; audio-visual speech recognition system; hidden Markov model; image sequence; lip movement synthesis; Application software; Control systems; Error correction; Hidden Markov models; Humans; Image sequences; Labeling; Natural languages; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN
0-7803-9361-9
Type
conf
DOI
10.1109/NLPKE.2005.1598781
Filename
1598781
Link To Document