DocumentCode
730767
Title
Coherent modification of pitch and energy for expressive prosody implantation
Author
Sorin, Alexander ; Shechtman, Slava ; Pollet, Vincent
Author_Institution
Speech Technol., IBM Res. - Haifa, Haifa, Israel
fYear
2015
fDate
19-24 April 2015
Firstpage
4914
Lastpage
4918
Abstract
In expressive TTS and voice transformation systems, implantation of expressive prosody derived from external out-of-domain sources often leads to extreme pitch modification that compromises the naturalness of the synthesized speech. In this work we investigate and prove a hypothesis that the naturalness loss is in part attributed to a violation of a fundamental relationship between the instantaneous pitch frequency and instantaneous energy of a speech signal. We propose an enhancement for pitch modification where the instantaneous energy is modified coherently with the pitch frequency and demonstrate the potential of this method in a subjective listening evaluation. The proposed approach is complementary to and can be combined with spectrum shape transformation methods for achieving the maximal possible quality of pitch modification.
Keywords
speech synthesis; voice equipment; energy coherent modification; expressive TTS; expressive prosody implantation; naturalness loss; pitch coherent modification; speech synthesis; voice transformation systems; Estimation; Harmonic analysis; Hidden Markov models; Noise; Prototypes; Shape; Speech; energy modification; energy modulation; expressive TTS; pitch modification; prosody modification;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178905
Filename
7178905
Link To Document