Lip movement synthesis in audio-visual speech recognition system

Author

Li, Junquan ; Yin, Yixin

Author_Institution

Univ. of Sci. & Technol., Beijing, China

fYear

2005

fDate

30 Oct.-1 Nov. 2005

Firstpage

461

Lastpage

465

Abstract

This paper describes a technique for synthesizing audio-visual speech based on HMM. The experiment "Lip movement synthesis" use VC++ and HTK toolkit. In the training stage, Japanese words will be change to text from the image sequence from using hidden Markov model (HMM). Experimental results show that the synthetic lip image sequence is smooth and realistic. This research was supported by NSFC, China (Grant No.60374032).

Keywords

audio-visual systems; hidden Markov models; image sequences; natural languages; speech recognition; HMM; HTK toolkit; VC++; audio-visual speech recognition system; hidden Markov model; image sequence; lip movement synthesis; Application software; Control systems; Error correction; Hidden Markov models; Humans; Image sequences; Labeling; Natural languages; Speech recognition; Speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on

Print_ISBN

0-7803-9361-9

Type

conf

DOI

10.1109/NLPKE.2005.1598781

Filename

1598781

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3317845