Title :
Incorporating dynamic features into minimum generation error training for HMM-based speech synthesis
Author :
Duy Khanh Ninh ; Morise, Masanori ; Yamashita, Yukihiko
Author_Institution :
Grad. Sch. of Sci. & Eng., Ritsumeikan Univ., Kusatsu, Japan
Abstract :
This paper describes new methods of minimum generation error (MGE) training in HMM-based speech synthesis by introducing the error component of dynamic features into the generation error function. We propose two methods for setting the weight associated with the additional error component. In fixed weighting approach, this weight is kept constant over the course of speech. In adaptive weighting approach, it is adjusted according to the degree of dynamic of speech segments. Objective evaluation shows that the newly derived MGE criterion with adaptive weighting method obtains comparable performance on static feature and better performance on delta feature compared to the baseline MGE criterion. Subjective evaluation exhibits an improvement in the quality of synthesized speech with the proposed technique. The newly derived criterion improves the capability of the HMMs in capturing dynamic properties of speech without increasing the computational complexity of training process compared to the baseline criterion.
Keywords :
computational complexity; hidden Markov models; speech synthesis; training; HMM-based speech synthesis; MGE training; baseline MGE criterion; baseline criterion; computational complexity; delta feature; dynamic features error component; dynamic features incorporation; generation error function; minimum generation error training; objective evaluation; speech course; speech dynamic properties; speech segment dynamics; speech synthesis quality; static feature; Heuristic algorithms; Hidden Markov models; Speech; Speech synthesis; Training; Training data; Vectors; HMM-based speech synthesis; dynamic features; minimum generation error training; spectral dynamics;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
DOI :
10.1109/ISCSLP.2012.6423486