• DocumentCode
    436389
  • Title

    Parameter selection for prosodic modelling in a restricted-domain spanish text-to-speech system

  • Author

    Montero, J.M. ; de Cordoba, R. ; Macias-Guarasa, Javier ; San-Segundo, R. ; Gutierrez-Arriola, J. ; Pardo, J.M.

  • Author_Institution
    Speech Technology Group, Electronic Engineering Dept., Universidad Politecnica de Madrid, E.T.S.I. Telecomunicacion, Ciudad Universitaria, 28040-Madrid, Spain
  • Volume
    18
  • fYear
    2004
  • fDate
    June 28 2004-July 1 2004
  • Firstpage
    93
  • Lastpage
    98
  • Abstract
    The prosodic modeling is one of the most important tasks for developing a new text-to-speech synthesizer, especially in a female-voice high-quality restricted-domain system. Our double objective is to get accurate predictors for both the F0 curve and phoneme duration by minimizing the model estimation error in a Spanish text-to-speech system. To achieve these complementary aims we needed to find the factors that most influence prosodic values in a given language. We have used neural networks and experimented with the different combinations of parameters that yield the minimum error in the estimation. In the restricted-domain environment the variation in the different patterns is reduced, and there are more instances of each parameter vector in the database. This way, the neural network proves to be an excellent tool for the modeling. The resulting system predicts prosody with very good results (for duration: 15.5 in in RMS and a correlation factor of 0.8975; for F0: 19.80 Hz in RMS and a relative RMS error of 0.43) that clearly improves our previous rule-based system.
  • Keywords
    Artificial neural networks; Databases; Decision trees; Knowledge based systems; Natural languages; Neural networks; Speech synthesis; Telecommunications; Timing; F0 modeling; Prosody; artificial neural networks; duration modeling; parameter coding; parameter selection; text-to-speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automation Congress, 2004. Proceedings. World
  • Conference_Location
    Seville
  • Print_ISBN
    1-889335-21-5
  • Type

    conf

  • Filename
    1441024