Title :
Corpus-based Mandarin speech synthesis with contextual syllabic units based on phonetic properties
Author :
Chou, Fu-chiang ; Tseng, Chiu-Yu
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
This paper describes an improved concatenative synthesis module for a Chinese text-to-speech system. The concatenated segments are on-line selected from a designed speech corpus that is precisely segmented with an improved version of HMMs. The selection criteria are the prosodic and contextual similarities between the units and the desired targets from the previous module of the TTS system. The TD-PSOLA modifies the prosodic parameters of the selected units, and three methods for unit concatenation are performed according to the types of the syllabic junctures. These types are classified with the knowledge from the phonetic observations of large amounts of speech data. The output speech is remarkably fluent and natural because the coarticulation effects cross syllabic boundaries are well modeled and less prosodic modification is needed for the TD-PSOLA
Keywords :
acoustic signal processing; hidden Markov models; natural languages; speech synthesis; Chinese text-to-speech system; HMM; TD-PSOLA; automatic segmentation; coarticulation effects; contextual similarities; contextual syllabic units; corpus-based Mandarin speech synthesis; cross syllabic boundaries; output speech; phonetic observations; phonetic properties; prosodic parameters; prosodic similarities; speech data; speech recognition; syllabic junctures; unit concatenation; Concatenated codes; Databases; Hidden Markov models; Humans; Labeling; Natural languages; Sections; Speech synthesis; Synthesizers;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675409