• DocumentCode
    699311
  • Title

    Audiovisual text-to-cued speech synthesis

  • Author

    Gibert, Guillaume ; Bailly, Gerard ; Elisei, Frederic ; Beautemps, Denis ; Brun, Remi

  • Author_Institution
    Inst. de la Commun. Parlee, Grenoble, France
  • fYear
    2004
  • fDate
    6-10 Sept. 2004
  • Firstpage
    1007
  • Lastpage
    1010
  • Abstract
    We present here our efforts for characterizing the 3D movements of the right hand and the face of a French female during the production of manual cued speech. We analyzed the 3D trajectories of 50 hand and 63 facial fleshpoints during the production of 238 utterances carefully designed for covering all possible diphones of the French language. Linear and non linear statistical models of the hand and face deformations and postures have been developed using both separate and joint corpora. We implement a concatenative audiovisual text-to-cued speech synthesis system.
  • Keywords
    speech synthesis; statistical analysis; 3D trajectory; French female; French language; concatenative audiovisual text-to-cued speech synthesis system; diphones; face deformation; joint corpora; linear statistical model; nonlinear statistical model; Abstracts; Face; Joints; Production; Quantization (signal); Speech synthesis; Three-dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2004 12th European
  • Conference_Location
    Vienna
  • Print_ISBN
    978-320-0001-65-7
  • Type

    conf

  • Filename
    7079841