DocumentCode :
3528939
Title :
Efficient gradient F0 tree model for prosody modeling and unit-selection, applied for the embedded US English concatenative TTS
Author :
Shechtman, Slava ; Tachibana, Ryuki
Author_Institution :
IBM Res., Haifa Res. Lab., Haifa
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4249
Lastpage :
4252
Abstract :
Modeling of pitch dynamics in addition to absolute pitch modeling is highly desirable for robust pitch curve prediction and unit selection in concatenative TTS systems. Transition prosody models have been reported to improve consistency and naturalness for pitch-accent and tonal languages, like Japanese and Mandarin. In the current work we revise a Gradient F0 tree model, originally developed for Japanese, and adjust it for American English. The resultant model requires few computational resources at a runtime that makes it highly suitable for embedded TTS applications. We report encouraging results of applying it for an embedded concatenative TTS system for American English.
Keywords :
gradient methods; natural language processing; speech synthesis; embedded US English concatenative TTS; gradient F0 tree model; pitch dynamics; robust pitch curve prediction; text-to-synthesis; unit selection; Decision support systems; Fiber reinforced plastics; Speech synthesis; Virtual reality; F0 modeling; embedded TTS; prosody modeling; speech synthesis; unit selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960567
Filename :
4960567
Link To Document :
بازگشت