Title :
Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis
Author :
Drugman, Thomas ; Moinet, Alexis ; Dutoit, Thierry ; Wilfart, Geoffrey
Author_Institution :
TCTS Lab., Fac. Polytech. de Mons, Mons
Abstract :
This paper proposes a method to improve the quality delivered by statistical parametric speech synthesizers. For this, we use a codebook of pitch-synchronous residual frames, so as to construct a more realistic source signal. First a limited codebook of typical excitations is built from some training database. During the synthesis part, HMMs are used to generate filter and source coefficients. The latter coefficients contain both the pitch and a compact representation of target residual frames. The source signal is obtained by concatenating excitation frames picked up from the codebook, based on a selection criterion and taking target residual coefficients as input. Subjective results show a relevant improvement compared to the basic technique.
Keywords :
hidden Markov models; speech coding; speech synthesis; frame selection speech synthesis; hidden Markov model; hybrid HMM; pitch-synchronous residual codebook; source signal construction; statistical parametric speech synthesizer; Cepstral analysis; Databases; Filtering; Filters; Hidden Markov models; Signal processing; Signal synthesis; Speech processing; Speech synthesis; Synthesizers; HMM-based Speech Synthesis; Hybrid Synthesis; Residual Modeling;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960453