DocumentCode :
394294
Title :
A Mandarin intonation prediction model that can output real pitch patterns
Author :
Pan, Neng-Huang ; Yu, Ming-shing ; Wu, Ming-Jer
Author_Institution :
Dept. of Appl. Math., Nat. Chung-Hsing Univ., Taichung, Taiwan
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
In this paper we proposed an intonation prediction model for Mandarin TTS systems. Our model can output real pitch patterns by finding a suitable real pitch pattern from the training corpus. This method is a new experiment. The advantages of our model are as follows. (1) It can improve the naturalness of the synthesized speech. It gets higher scores in the subjective listening tests. (2) It has high accuracies. Average errors of 0.425 ms and 0.457 ms were obtained for the inside and outside tests, respectively. Pattern errors of 0.128 ms and 0.129 ms were obtained for the inside and outside tests, respectively. We found that the pattern error measurement method complies with human hearing. (3) The training corpus need not be very large. It can relieve the data sparsity problem.
Keywords :
pattern recognition; speech processing; speech synthesis; Mandarin intonation prediction model; TTS systems; accuracies; data sparsity problem; human hearing; pattern error measurement; real pitch patterns; subjective listening tests; synthesized speech naturalness; training corpus; Auditory system; Electronic mail; Humans; Mathematical model; Mathematics; Predictive models; Signal synthesis; Speech synthesis; Testing; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198826
Filename :
1198826
Link To Document :
بازگشت