Audiovisual text-to-cued speech synthesis

Author

Gibert, Guillaume ; Bailly, Gerard ; Elisei, Frederic ; Beautemps, Denis ; Brun, Remi

Author_Institution

Inst. de la Commun. Parlee, Grenoble, France

fYear

2004

fDate

6-10 Sept. 2004

Firstpage

1007

Lastpage

1010

Abstract

We present here our efforts for characterizing the 3D movements of the right hand and the face of a French female during the production of manual cued speech. We analyzed the 3D trajectories of 50 hand and 63 facial fleshpoints during the production of 238 utterances carefully designed for covering all possible diphones of the French language. Linear and non linear statistical models of the hand and face deformations and postures have been developed using both separate and joint corpora. We implement a concatenative audiovisual text-to-cued speech synthesis system.

Keywords

speech synthesis; statistical analysis; 3D trajectory; French female; French language; concatenative audiovisual text-to-cued speech synthesis system; diphones; face deformation; joint corpora; linear statistical model; nonlinear statistical model; Abstracts; Face; Joints; Production; Quantization (signal); Speech synthesis; Three-dimensional displays;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing Conference, 2004 12th European

Conference_Location

Vienna

Print_ISBN

978-320-0001-65-7

Type

conf

Filename

7079841

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=699311