DocumentCode
2066965
Title
FlexVoice: a parametric approach to high-quality speech synthesis
Author
Balogh, Gyorgy ; Dobler, Ervin ; Gróbler, Tamás ; Smodies, B. ; Szepesvári, Csaba
Author_Institution
Mindmaker Ltd., Budapest, Hungary
fYear
2000
fDate
2000
Abstract
The TTS system described in this paper is based on the analysis and resynthesis of a given speaker´s voice. First, the speaker´s voice definition is prepared off-line: a diphone database is recorded, segmented, and analyzed in every 6 msec to obtain the filter parameters of an all-pole (AR) filter. During the on-line synthesis, the filters are excited with the mixture of a predefined periodic glottal source and white noise. Rigorous experiments have been made to find the parameter space in which the filter coefficients at diphone boundaries can effectively be smoothened. The best representation turned out to be the space of area ratios. Due to the smoothening and the carefully chosen corpus words, each diphone needs to be recorded only once thus no unit selection algorithm is needed. FlexVoice provides large flexibility in changing voice properties independently from the vocal tract parameters. This flexibility can be demonstrated by a number of voice conversions including female-to-male and female-to-child conversions. FlexVoice only uses a fraction of the resources of a PC and its quality is comparable to that of the leading TTS systems
Keywords
speech synthesis; FlexVoice; TTS system; all-pole filter; corpus words; diphone boundaries; diphone database; experiments; filter coefficients; filter parameters; high-quality speech synthesis; on-line synthesis; parameter space; parametric approach; periodic glottal source; space of area ratios; speaker voice definition; vocal tract parameters; voice analysis; voice conversation; voice conversions; voice properties; voice resynthesis; white noise;
fLanguage
English
Publisher
iet
Conference_Titel
State of the Art in Speech Synthesis (Ref. No. 2000/058), IEE Seminar on
Conference_Location
London
Type
conf
DOI
10.1049/ic:20000332
Filename
846972
Link To Document