DocumentCode :
2703260
Title :
Database Mining for Flexible Concatenative Text-to-Speech
Author :
Eide, E.M. ; Fernandez, Raul
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
In this paper we explore mining a concatenative text-to-speech database to exploit subtle, naturally-occurring stylistic and contextual variability for runtime synthesis. By making a desired style or context known to the search during synthesis, the cost function can be biased toward finding units which satisfy these additional criteria. Having the ability to bias the output of the synthesizer towards a particular voice quality, or other characteristic such as speaking rate, increases its flexibility and potential value. In this paper we illustrate the approach to synthesizing subtle speech variation by focusing on three aspects: prosodic structure (phrase-finalness), prosodic prominence (prosodic accent), and voice quality (breathiness). Target values for the first two of these are automatically generated, while the target value for breathiness is specified by the user. We present results which indicate the value of distinguishing our data along these dimensions, and discuss possible improvements and new uses in the future.
Keywords :
audio databases; data mining; speech synthesis; contextual variability; database mining; flexible concatenative text-to-speech; naturally-occurring stylistic; phrase-finalness; prosodic accent; prosodic prominence; prosodic structure; voice quality; Character generation; Cost function; Data mining; Databases; Memory; Runtime; Signal processing; Speech analysis; Speech synthesis; Synthesizers; Speech synthesis; speech analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367008
Filename :
4218196
Link To Document :
بازگشت