Title :
Hybrid time- and frequency-domain speech synthesis with extended glottal source generation
Author_Institution :
Forschungs und Technol., Deutsche Bundespost Telekom, Darmstadt, Germany
Abstract :
A novel synthesis approach that combines speech synthesis in the time domain with speech synthesis in the frequency domain is introduced. The intention is to improve speech quality by designing a hybrid system which profits from the advantages of both methods and overcomes some of their drawbacks. Compared to a stand-alone formant synthesizer, a better quality of fricatives and plosives has been achieved, whereas the flexibility in fundamental frequency variation is preserved. Moreover, simultaneous use of both system components enables the system to produce naturally sounding transitions at the segment boundaries. The parametric part of the hybrid system-a formant-based synthesizer-is excited with a time-domain source generation scheme. It is based on concatenation and modification of stored natural source waveforms. Important system features are phoneme-specific variants of stored source waveforms and additional generation of shimmer and jitter. Preliminary informal listening tests showed that the naturalness of the voiced sounds has been improved compared to the results of the previous synthesizer
Keywords :
frequency-domain synthesis; speech synthesis; time-domain synthesis; concatenation; extended glottal source generation; formant-based synthesizer; frequency-domain speech synthesis; fricatives; fundamental frequency variation; hybrid system; informal listening tests; jitter; naturally sounding transitions; phoneme-specific variants; plosives; segment boundaries; shimmer; speech quality; stored natural source waveforms; stored source waveforms; time-domain source generation; time-domain speech synthesis; voiced sounds; Acoustic testing; Control system synthesis; Frequency domain analysis; Frequency synthesizers; Hybrid power systems; Jitter; Signal synthesis; Speech synthesis; System performance; Time domain analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389227