DocumentCode :
3528960
Title :
CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories
Author :
Ni, Jinfu ; Sakai, Shinsuke ; Shimizu, Tohru ; Nakamura, Satoshi
Author_Institution :
Nat. Inst. of Inf. & Commun. Technol.
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4253
Lastpage :
4256
Abstract :
We propose an approach to modeling Chinese tonal patterns, focusing on the basic fundamental frequency (F0) patterns characterized by the contextual linguistic features that can be directly extracted from text. We analyze tonal patterns as sparse target points (tonal F0 peaks and valleys) and represent them in parametric form within the framework of a functional F0 model. The relationships between the target points and underlying linguistic features are trained using classification and regression tree analysis (CARTs), and this functional model is used to trace the F0 trajectories when training the CARTs and to synthesize a tonal pattern from the target points predicted by the CARTs. Our experiments indicate that the proposed method has low F0 prediction errors. Utilization of the F0 ranges measured from training samples could significantly reduce the influences of differences in voice ranges on training a speaker-independent model. Furthermore, the most important roles in characterizing tonal patterns were played by a few linguistic features such as lexical tone context and the distinction between voiced from unvoiced initials.
Keywords :
learning (artificial intelligence); speech processing; speech synthesis; Chinese tonal patterns; Prosody modeling; cart-based modeling; contextual linguistic features; functional model tracing; fundamental frequency trajectories; machine learning; regression tree analysis; speaker-independent model; speech processing; speech synthesis; Classification tree analysis; Context modeling; Data mining; Frequency; Hidden Markov models; Natural languages; Pattern analysis; Predictive models; Regression tree analysis; Speech synthesis; Prosody modeling; functional F0 model; machine learning; speech processing; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960568
Filename :
4960568
Link To Document :
بازگشت