• DocumentCode
    312281
  • Title

    Automatic generation of prosodic structure for high quality Mandarin speech synthesis

  • Author

    Chou, Fu-Chiang ; Tseng, Chiu-Yu ; Lee, Lin-shan

  • Author_Institution
    Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • Volume
    3
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1624
  • Abstract
    A key problem for today´s speech synthesis technology is to automatically generate an appropriate hierarchical prosodic structure for text input and incorporate it into synthesized speech. The paper presents a method for such a problem in Mandarin Chinese. This method uses a speech database for the training of a statistical model to generate the prosodic structure and determine prosodic parameters such as syllable duration, pause, energy and intonation. The experimental results show that an accuracy of 83.1% in the prediction of prosodic structure can be achieved. Furthermore, a Chinese text-to-speech system can be developed based on the proposed prosodic structure
  • Keywords
    natural languages; speech synthesis; statistical analysis; Chinese text-to-speech system; automatic hierarchical prosodic structure generation; energy; high quality Mandarin speech synthesis; intonation; pause; speech database; statistical model training; syllable duration; synthesized speech; text input; Appropriate technology; History; Information science; Labeling; Neural networks; Predictive models; Spatial databases; Speech synthesis; Tagging; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607935
  • Filename
    607935