• DocumentCode
    1703834
  • Title

    A Macro-Prosodic Indications for a Romanian TtS System Based on the Functional Intonational Model

  • Author

    Apopei, Vasile ; Jitca, Doina ; Paduraru, Otilia

  • Author_Institution
    Inst. of Comput. Sci., Iasi, Romania
  • fYear
    2012
  • Firstpage
    186
  • Lastpage
    190
  • Abstract
    This paper presents how macro-prosodic indications can be used within our TtS system in order to drive the prosody prediction module (PPM) in generating a target intonational contour for the synthesized utterance. The previous variant of the PPM generates melodic contour descriptions only using the implicit prosodic indications deduced from the text analysis. The explicit indications are edited in the input text window and refer to the group markers applied to a text to which can be assigned a focus attribute with an indication on its strength. It can be also assigned to a group an attribute of tonal prominence (related to the tonal level of the highest peak in the group). We called these indications the macro-prosodic indications. We also defined a microprosodic indication that refers to the pitch accent type attribute at the accentual unit level that is related to F0 contour pattern. The values of this attribute are taken from the Ro-ToBI label set. Both macro and microprosodic indications lead to elicit certain intonational variants at the speech synthesis output and also can improve the Romanian intonation contour understanding.
  • Keywords
    natural language processing; speech processing; speech synthesis; text analysis; F0 contour pattern; PPM; Ro-ToBI label set; Romanian TtS system; Romanian intonation contour understanding; accentual unit level; intonational variants; macro-prosodic indications; melodic contour description generation; prosody prediction module; speech synthesis output; synthesized utterance; target intonational contour generation; tenxt-to-speech system; text analysis; Acoustics; Computer science; Dictionaries; Gold; IP networks; Pragmatics; Speech; F0 contour generation; macroprosodic indication; tonal space; utterance tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Security Technologies (EST), 2012 Third International Conference on
  • Conference_Location
    Lisbon
  • Print_ISBN
    978-1-4673-2448-9
  • Type

    conf

  • DOI
    10.1109/EST.2012.48
  • Filename
    6328109