Title :
An F0 modeling technique based on prosodic events for spontaneous speech synthesis
Author :
Koriyama, Tomoki ; Nose, Takashi ; Kobayashi, Takao
Author_Institution :
Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama, Japan
Abstract :
This paper proposes a technique for effective modeling of F0 contours using prosodic-event-based HMM units for HMM-based spontaneous speech synthesis. The modeling unit corresponds to one of prosodic event segments such as pitch falling by accent and pitch rising by boundary pitch movement (BPM). Since the prosodic events of one phrase are generally less frequent than the changes of phonemes, the proposed unit is expected to reduce the number of model parameters of F0, which leads to robust parameter estimation. The objective and subjective experiments using spontaneous conversational speech data show that the proposed technique can significantly reduce the number of model parameters while keeping the naturalness of the synthetic speech.
Keywords :
hidden Markov models; speech synthesis; BPM; F0 modeling technique; accent; boundary pitch movement; hidden Markov model; objective experiments; pitch falling; pitch rising; prosodic-event-based HMM units; robust parameter estimation; spontaneous speech synthesis; subjective experiments; Context; Correlation; Hidden Markov models; Nose; Speech; Speech synthesis; Timing; F0 modeling; HMM-based speech synthesis; Prosodic events; Spontaneous speech;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288940