Title :
On the prediction of global F0 shape for Japanese text-to-speech
Author :
Sagisaka, Yoshinori
Author_Institution :
ATR Interpreting Telephony Res. Lab., Kyoto, Japan
Abstract :
The global F0 shape of Japanese speech is predicted by phrasal accent attributes and adjacent phrasal environment using three-layered neural nets. Three F0 values of each minor phrase are used for the global shape description, and their prediction is carried out in each major phrase determined by right-branching syntactic boundaries. Through prediction experiments using short and ordinary sentence samples, it is quantitatively confirmed that the global F0 shapes are predicted fairly well in both samples and that additional controls are necessary for finer prediction in the ordinary sentence samples
Keywords :
filtering and prediction theory; neural nets; speech synthesis; Japanese text-to-speech; adjacent phrasal environment; global F0 shape; minor phrase; ordinary sentence samples; phrasal accent attributes; right-branching syntactic boundaries; sentence samples; shape description; three-layered neural nets; Control system synthesis; Error correction; Neural networks; Predictive models; Shape control; Speech; Speech synthesis; Telephony;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115662