• DocumentCode
    2066965
  • Title

    FlexVoice: a parametric approach to high-quality speech synthesis

  • Author

    Balogh, Gyorgy ; Dobler, Ervin ; Gróbler, Tamás ; Smodies, B. ; Szepesvári, Csaba

  • Author_Institution
    Mindmaker Ltd., Budapest, Hungary
  • fYear
    2000
  • fDate
    2000
  • Abstract
    The TTS system described in this paper is based on the analysis and resynthesis of a given speaker´s voice. First, the speaker´s voice definition is prepared off-line: a diphone database is recorded, segmented, and analyzed in every 6 msec to obtain the filter parameters of an all-pole (AR) filter. During the on-line synthesis, the filters are excited with the mixture of a predefined periodic glottal source and white noise. Rigorous experiments have been made to find the parameter space in which the filter coefficients at diphone boundaries can effectively be smoothened. The best representation turned out to be the space of area ratios. Due to the smoothening and the carefully chosen corpus words, each diphone needs to be recorded only once thus no unit selection algorithm is needed. FlexVoice provides large flexibility in changing voice properties independently from the vocal tract parameters. This flexibility can be demonstrated by a number of voice conversions including female-to-male and female-to-child conversions. FlexVoice only uses a fraction of the resources of a PC and its quality is comparable to that of the leading TTS systems
  • Keywords
    speech synthesis; FlexVoice; TTS system; all-pole filter; corpus words; diphone boundaries; diphone database; experiments; filter coefficients; filter parameters; high-quality speech synthesis; on-line synthesis; parameter space; parametric approach; periodic glottal source; space of area ratios; speaker voice definition; vocal tract parameters; voice analysis; voice conversation; voice conversions; voice properties; voice resynthesis; white noise;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    State of the Art in Speech Synthesis (Ref. No. 2000/058), IEE Seminar on
  • Conference_Location
    London
  • Type

    conf

  • DOI
    10.1049/ic:20000332
  • Filename
    846972