DocumentCode
3412428
Title
Control of fundamental frequency contours using the generation process model in HMM-based speech synthesis
Author
Matsuda, Tetsuya ; Hirose, Keikichi ; Minematsu, Nobuaki
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear
2010
fDate
24-28 Oct. 2010
Firstpage
617
Lastpage
620
Abstract
A method was proposed to increase the naturalness of prosody generated with speech synthesis based on hidden Markov models (HMMs). This method adds a constraint to the fundamental frequency contours (F0 contours) during the HMM-based speech synthesis. The constraint adopted is the generation process model of F0 contours (F0 model). The method first extracts the F0 model parameters from the original Fo contour (generated by the HMM-based speech synthesis) and then optimizes them successively by referring to the pre-trained HMMs. The experimental results show that the proposed method can improve naturalness when the F0 control by the original method is inadequate.
Keywords
hidden Markov models; speech synthesis; F0 control; fundamental frequency contours; generation process model; hidden Markov models; speech synthesis; Context modeling; Hidden Markov models; Optimization; Smoothing methods; Speech; Speech synthesis; Fo contour; HMM-based speech synthesis; generation process model;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing (ICSP), 2010 IEEE 10th International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-5897-4
Type
conf
DOI
10.1109/ICOSP.2010.5656358
Filename
5656358
Link To Document