DocumentCode :
2501270
Title :
F0 analysis for Japanese conversational speech synthesis
Author :
Nakajima, Hideharu ; Sagisaka, Yoshinori
Author_Institution :
NTT Cyber Space Labs., NTT Corp., Yokosuka, Japan
fYear :
2009
fDate :
20-22 Oct. 2009
Firstpage :
137
Lastpage :
142
Abstract :
This paper proposes a conversational style text-to-speech synthesis scheme based on an analysis of fundamental frequency, F0. Through the analysis, we confirm that conversational F0 can be represented by the superpositional model using three components ranging utterance, major phrase, and minor phrase. We compare each component of the model between conversational style and reading style to investigate the following points: where big F0 discrepancies are found, what linguistic factors concern to the discrepancies, and to what extent do such discrepancies occur. This paper uses real domain data that includes a lot of linguistic context. Analysis confirms that large differences occur in global components such as single span whole utterances and phrases, and that the differences occur at or around domain-specific expressions. The analysis also reveals that local components are almost the same in both styles. These analyses show that it is necessary to estimate the utterance and phrase components from words attributes other than the grammatical clues to realize conversational synthesis in the super positional manner.
Keywords :
linguistics; natural languages; speech synthesis; F0 analysis; Japanese speech synthesis; conversational style text-speech synthesis scheme; domain-specific expression; fundamental frequency analysis; linguistic factor; speech utterance; superpositional model; Frequency conversion; Frequency synthesizers; Information analysis; Natural language processing; Pattern analysis; Speech analysis; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing, 2009. SNLP '09. Eighth International Symposium on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4244-4138-9
Electronic_ISBN :
978-1-4244-4139-6
Type :
conf
DOI :
10.1109/SNLP.2009.5340932
Filename :
5340932
Link To Document :
بازگشت