DocumentCode
699311
Title
Audiovisual text-to-cued speech synthesis
Author
Gibert, Guillaume ; Bailly, Gerard ; Elisei, Frederic ; Beautemps, Denis ; Brun, Remi
Author_Institution
Inst. de la Commun. Parlee, Grenoble, France
fYear
2004
fDate
6-10 Sept. 2004
Firstpage
1007
Lastpage
1010
Abstract
We present here our efforts for characterizing the 3D movements of the right hand and the face of a French female during the production of manual cued speech. We analyzed the 3D trajectories of 50 hand and 63 facial fleshpoints during the production of 238 utterances carefully designed for covering all possible diphones of the French language. Linear and non linear statistical models of the hand and face deformations and postures have been developed using both separate and joint corpora. We implement a concatenative audiovisual text-to-cued speech synthesis system.
Keywords
speech synthesis; statistical analysis; 3D trajectory; French female; French language; concatenative audiovisual text-to-cued speech synthesis system; diphones; face deformation; joint corpora; linear statistical model; nonlinear statistical model; Abstracts; Face; Joints; Production; Quantization (signal); Speech synthesis; Three-dimensional displays;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2004 12th European
Conference_Location
Vienna
Print_ISBN
978-320-0001-65-7
Type
conf
Filename
7079841
Link To Document