DocumentCode :
3412428
Title :
Control of fundamental frequency contours using the generation process model in HMM-based speech synthesis
Author :
Matsuda, Tetsuya ; Hirose, Keikichi ; Minematsu, Nobuaki
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear :
2010
fDate :
24-28 Oct. 2010
Firstpage :
617
Lastpage :
620
Abstract :
A method was proposed to increase the naturalness of prosody generated with speech synthesis based on hidden Markov models (HMMs). This method adds a constraint to the fundamental frequency contours (F0 contours) during the HMM-based speech synthesis. The constraint adopted is the generation process model of F0 contours (F0 model). The method first extracts the F0 model parameters from the original Fo contour (generated by the HMM-based speech synthesis) and then optimizes them successively by referring to the pre-trained HMMs. The experimental results show that the proposed method can improve naturalness when the F0 control by the original method is inadequate.
Keywords :
hidden Markov models; speech synthesis; F0 control; fundamental frequency contours; generation process model; hidden Markov models; speech synthesis; Context modeling; Hidden Markov models; Optimization; Smoothing methods; Speech; Speech synthesis; Fo contour; HMM-based speech synthesis; generation process model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing (ICSP), 2010 IEEE 10th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-5897-4
Type :
conf
DOI :
10.1109/ICOSP.2010.5656358
Filename :
5656358
Link To Document :
بازگشت