Title :
Decision tree based duration prediction in Mandarin
Author :
Qing Quo ; Katae, Nobuyuki
Author_Institution :
Fujitsu R&D Center China, Beijing, China
fDate :
30 Oct.-1 Nov. 2005
Abstract :
This paper reports the methodology and results of decision tree based duration prediction for Mandarin text-to-speech system developed by the FUJITSU Laboratories. Syllable initials and finals are the basic units in our duration study. In this paper, the factors that influence the finals such as phrase boundary and phone context are discussed in detail. Experiments show that the prosodic factor of whether the right phrase boundary level is prosodic word or higher level is the most important determinant of duration. Furthermore, the degree of phrase boundary vowel lengthening may vary depending on the different kinds of finals. And this paper also explains the methods for objective evaluation of the performance of the duration prediction model.
Keywords :
decision trees; natural languages; speech processing; speech synthesis; FUJITSU Laboratories; Mandarin text-to-speech system; decision tree based duration prediction model; phone context; phrase boundary level; phrase boundary vowel lengthening; prosodic factor; prosodic word; Additives; Databases; Decision trees; Degradation; Greedy algorithms; Laboratories; Natural languages; Predictive models; Speech processing; Speech synthesis;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598736