DocumentCode
312281
Title
Automatic generation of prosodic structure for high quality Mandarin speech synthesis
Author
Chou, Fu-Chiang ; Tseng, Chiu-Yu ; Lee, Lin-shan
Author_Institution
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume
3
fYear
1996
fDate
3-6 Oct 1996
Firstpage
1624
Abstract
A key problem for today´s speech synthesis technology is to automatically generate an appropriate hierarchical prosodic structure for text input and incorporate it into synthesized speech. The paper presents a method for such a problem in Mandarin Chinese. This method uses a speech database for the training of a statistical model to generate the prosodic structure and determine prosodic parameters such as syllable duration, pause, energy and intonation. The experimental results show that an accuracy of 83.1% in the prediction of prosodic structure can be achieved. Furthermore, a Chinese text-to-speech system can be developed based on the proposed prosodic structure
Keywords
natural languages; speech synthesis; statistical analysis; Chinese text-to-speech system; automatic hierarchical prosodic structure generation; energy; high quality Mandarin speech synthesis; intonation; pause; speech database; statistical model training; syllable duration; synthesized speech; text input; Appropriate technology; History; Information science; Labeling; Neural networks; Predictive models; Spatial databases; Speech synthesis; Tagging; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607935
Filename
607935
Link To Document