DocumentCode :
454651
Title :
Applying Pitch Target Model to Convert F0 Contour for Expressive Mandarin Speech Synthesis
Author :
Kang, Yongguo ; Tao, Jianhua ; Xu, Bo
Author_Institution :
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, ygkang@nlpr.ia.ac.cn
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
In the paper, pitch target model is employed to represent and convert F0 contour for synthesizing an emotional Mandarin speech from a neutral speech. Compared with conventional F0 transforming methods, the proposed method converts F0 patterns described by pitch target parameters rather than F0 contours themselves, and uses Gaussian Mixture Model(GMM) and Classification and Regression Trees (CART) methods to build mapping functions for well-chosen pitch target parameters. Other prosodic parameters such as duration and intensity are also converted. Listening tests prove that these converted speeches express corresponding emotional states.
Keywords :
Automation; Classification tree analysis; Electronic switching systems; Laboratories; Paper technology; Pattern recognition; Regression tree analysis; Speech synthesis; Technological innovation; Vegetation mapping;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660125
Filename :
1660125
Link To Document :
بازگشت