DocumentCode
3528967
Title
Control of prosodic focus in corpus-based generation of fundamental frequency contours of Japanese based on the generation process model
Author
Ochi, Kiyoshi ; Hirose, Keikichi ; Minematsu, Nobuaki
Author_Institution
Dept. of Inf. & Commun. Eng., Univ. of Tokyo, Tokyo
fYear
2009
fDate
19-24 April 2009
Firstpage
4257
Lastpage
4260
Abstract
A total corpus-based process of generating prosodic features from text is developed. The process first predicts pauses and phone durations, and then generates F0 contours. Since F0 contour generation is based on the generation process model, it is rather easy to manipulate the generated F0 contours in command level. A method was developed for generating sentence F0 contours, when a focus is placed in one of the ldquobunsetsurdquo of an utterance. The method is to predict differences in the F0 model commands between with and without focus utterances, and apply them to the F0 model commands predicted beforehand by the baseline method. The validity of the method was proved by the experiment on F0 contour generation and speech synthesis.
Keywords
natural language processing; speech synthesis; Japanese; bunsetsu; corpus-based process; fundamental frequency contour generation; generation process model; prosodic feature generation; prosodic focus control; speech synthesis; Automatic control; Communication system control; Frequency; Hidden Markov models; Information systems; Predictive models; Speech analysis; Speech synthesis; Testing; Training data; Corpus-based method; F0 contour; Generation process model; Prosodic focus; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960569
Filename
4960569
Link To Document