• DocumentCode
    323539
  • Title

    Automatic generation of synthesis units for trainable text-to-speech systems

  • Author

    Hon, H. ; Acero, A. ; Huang, X. ; Liu, J. ; Plumpe, M.

  • Author_Institution
    Microsoft Corp., Redmond, WA, USA
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    293
  • Abstract
    The Whistler text-to-speech engine was designed so that we can automatically construct the model parameters from training data. This paper describes in detail the design issues of constructing the synthesis unit inventory automatically from speech databases. The automatic process includes (1) determining the scaleable synthesis unit which can reflect spectral variations of different allophones; (2) segmenting the recording sentences into phonetic segments; (3) select good instances for each synthesis unit to generate best synthesis sentence during the run time. These processes are all derived through the use of probabilistic learning methods which are aimed at the same optimization criteria. Through this automatic unit generation, Whistler can automatically produce synthetic speech that sounds very natural and resembles the acoustic characteristics of the original speaker
  • Keywords
    learning (artificial intelligence); optimisation; speech synthesis; Whistler text-to-speech engine; acoustic characteristics; allophones; automatic generation; automatic unit generation; design; optimization criteria; phonetic segments; probabilistic learning; recording sentences; scaleable synthesis unit; segmentation; spectral variations; speech databases; synthesis units; synthetic speech; trainable text-to-speech systems; Character generation; Databases; Engines; Humans; Learning systems; Loudspeakers; Optimization methods; Speech synthesis; Synthesizers; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674425
  • Filename
    674425