Title :
Auditive learning based Chinese F0 prediction
Author :
Tao Jianhua ; Xing, Ni
Author_Institution :
Nat. Lab of Pattern Recognition, Chinese Acad. of Sci., Beijing, China
Abstract :
The paper described a new F0 model based on auditive learning (AL) method. Being focused on the notion of prosody templates, we confirmed that F0 patterns for a syllable can be extracted from various anamorphosis of F0 contours in spontaneous speech. It is much suitable to use F0 templates selection method for Chinese F0 prediction with prosody cost function (PCF). Furthermore, an AL method is used to adjust the weight of PCF dynamically in application. Unlike other methods, the approach may give feedback as to exactly what are crucial parameters determining the successful choice of patterns. The paper also analyzes the error distribution of the F0 predicting results. Both smoothing testing and F0 range testing show that the synthesis results are much closed to human being.
Keywords :
natural languages; speech processing; speech synthesis; Chinese F0 prediction; anamorphosis; auditive learning method; error distribution; prosody cost function; prosody templates; spontaneous speech; Clustering methods; Cost function; Databases; Decision trees; Error analysis; Humans; Smoothing methods; Speech synthesis; Stress; Testing;
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
DOI :
10.1109/ICME.2003.1221286