DocumentCode :
3164484
Title :
An F0 modeling technique based on prosodic events for spontaneous speech synthesis
Author :
Koriyama, Tomoki ; Nose, Takashi ; Kobayashi, Takao
Author_Institution :
Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama, Japan
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4589
Lastpage :
4592
Abstract :
This paper proposes a technique for effective modeling of F0 contours using prosodic-event-based HMM units for HMM-based spontaneous speech synthesis. The modeling unit corresponds to one of prosodic event segments such as pitch falling by accent and pitch rising by boundary pitch movement (BPM). Since the prosodic events of one phrase are generally less frequent than the changes of phonemes, the proposed unit is expected to reduce the number of model parameters of F0, which leads to robust parameter estimation. The objective and subjective experiments using spontaneous conversational speech data show that the proposed technique can significantly reduce the number of model parameters while keeping the naturalness of the synthetic speech.
Keywords :
hidden Markov models; speech synthesis; BPM; F0 modeling technique; accent; boundary pitch movement; hidden Markov model; objective experiments; pitch falling; pitch rising; prosodic-event-based HMM units; robust parameter estimation; spontaneous speech synthesis; subjective experiments; Context; Correlation; Hidden Markov models; Nose; Speech; Speech synthesis; Timing; F0 modeling; HMM-based speech synthesis; Prosodic events; Spontaneous speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288940
Filename :
6288940
Link To Document :
بازگشت