Title :
A Mandarin intonation prediction model that can output real pitch patterns
Author :
Pan, Neng-Huang ; Yu, Ming-shing ; Wu, Ming-Jer
Author_Institution :
Dept. of Appl. Math., Nat. Chung-Hsing Univ., Taichung, Taiwan
Abstract :
In this paper we proposed an intonation prediction model for Mandarin TTS systems. Our model can output real pitch patterns by finding a suitable real pitch pattern from the training corpus. This method is a new experiment. The advantages of our model are as follows. (1) It can improve the naturalness of the synthesized speech. It gets higher scores in the subjective listening tests. (2) It has high accuracies. Average errors of 0.425 ms and 0.457 ms were obtained for the inside and outside tests, respectively. Pattern errors of 0.128 ms and 0.129 ms were obtained for the inside and outside tests, respectively. We found that the pattern error measurement method complies with human hearing. (3) The training corpus need not be very large. It can relieve the data sparsity problem.
Keywords :
pattern recognition; speech processing; speech synthesis; Mandarin intonation prediction model; TTS systems; accuracies; data sparsity problem; human hearing; pattern error measurement; real pitch patterns; subjective listening tests; synthesized speech naturalness; training corpus; Auditory system; Electronic mail; Humans; Mathematical model; Mathematics; Predictive models; Signal synthesis; Speech synthesis; Testing; Text analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198826