Title :
Speech synthesis system based on a variable decimation/interpolation factor
Author :
De los Galanes, F. M Giménez ; Savoji, M.H. ; Pardo, J.M.
Author_Institution :
ETSI Telecomunicacion, Univ. Politecnica de Madrid, Spain
Abstract :
In this paper we present a modification of the usual decimation-interpolation steps for resampling of speech signals which is especially adapted to arbitrary modification of fundamental frequency and duration of speech segments. The modification is intended to overcome the time and frequency domain limitation that such a resampling scheme imposes so it can be used in a speech synthesis system. The performance of this resampling method for prosody modification is better than the equivalent PSOLA (Pitch-Synchronous Overlap-Add) method when working at a sampling frequency of 8 to 10 kilohertz so the source spectrum of the voiced allophones can be said to be completely harmonical. An optimization of the proposed algorithm that allows a real time implementation is also discussed
Keywords :
interpolation; optimisation; signal sampling; speech synthesis; algorithm; frequency domain limitation; fundamental frequency; interpolation factor; optimization; performance; prosody modification; real time implementation; resampling; source spectrum; speech segments duration; speech signals; speech synthesis system; time domain limitation; variable decimation; voiced allophones; Databases; Frequency domain analysis; Interpolation; Linear predictive coding; Phase change materials; Sampling methods; Signal analysis; Speech analysis; Speech synthesis; Telecommunications;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479678