Title :
Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system
Author :
Eunwoo Song ; Young-Sun Joo ; Hong-Goo Kang
Author_Institution :
Dept. of Electr. & Electron. Eng., Yonsei Univ., Seoul, South Korea
Abstract :
This paper proposes an improved time-frequency trajectory excitation (TFTE) modeling method for a statistical parametric speech synthesis system. The proposed approach overcomes the dimensional variation problem of the training process caused by the inherent nature of the pitch-dependent analysis paradigm. By reducing the redundancies of the parameters using predicted average block coefficients (PABC), the proposed algorithm efficiently models excitation, even if its dimension is varied. Objective and subjective test results verify that the proposed algorithm provides not only robustness to the training process but also naturalness to the synthesized speech.
Keywords :
speech synthesis; statistical analysis; time-frequency analysis; PABC; TFTE modeling; dimensional variation problem; naturalness; pitch-dependent analysis paradigm; statistical parametric speech synthesis system; time-frequency trajectory excitation modeling; training process; Algorithm design and analysis; Hidden Markov models; Speech; Speech synthesis; Time-frequency analysis; Training; Statistical parametric speech synthesis; predicted average block coefficient (PABC); slowly evolving waveform (SEW); time-frequency trajectory excitation (TFTE);
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178912