DocumentCode :
24851
Title :
A Parameter Generation Algorithm Using Local Variance for HMM-Based Speech Synthesis
Author :
Nose, Takashi ; Chunwijitra, Vataya ; Kobayashi, Takehiko
Author_Institution :
Dept. of Inf. Process., Tokyo Inst. of Technol., Yokohama, Japan
Volume :
8
Issue :
2
fYear :
2014
fDate :
Apr-14
Firstpage :
221
Lastpage :
228
Abstract :
This paper proposes a parameter generation algorithm using a local variance (LV) model in HMM-based speech synthesis. In the proposed technique, we define the LV as a feature that represents the local variation of a spectral parameter sequence and model LVs using HMMs. Context-dependent HMMs are used to capture the dependence of LV trajectories on phonetic and prosodic contexts. In addition, the dynamic features of LVs are taken into account as well as the static one to appropriately model the dynamic characteristics of LV trajectories. By introducing the LV model into the spectral parameter generation process, the proposed technique can impose a more precise variance constraint for each frame than the conventional technique with a global variance (GV) model. Consequently, the proposed technique alleviates the excessive spectral peak enhancement that often occurs in GV-based parameter generation. Objective evaluation results show that the proposed technique can generate better spectral parameter trajectories than the GV-based technique in terms of spectral and LV distortion. Moreover, the results of subjective evaluation demonstrate that the proposed technique can generate synthetic speech significantly closer to the original one than the conventional technique while maintaining speech naturalness.
Keywords :
hidden Markov models; speech synthesis; GV model; HMM based speech synthesis; LV model; context dependent HMM; global variance; local variance model; parameter generation algorithm; phonetic contexts; prosodic contexts; spectral parameter generation process; spectral parameter sequence; spectral peak enhancement; Context modeling; Correlation; Hidden Markov models; Speech; Training; Trajectory; Vectors; HMM-based speech synthesis; local variance; over-smoothing problem; spectral parameter generation;
fLanguage :
English
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
Publisher :
ieee
ISSN :
1932-4553
Type :
jour
DOI :
10.1109/JSTSP.2013.2283459
Filename :
6609040
Link To Document :
بازگشت