Title :
Residual-based speech modification algorithms for text-to-speech synthesis
Author :
Edgington, M. ; Lowry, A.
Author_Institution :
British Telecom Res. Labs., Ipswich, UK
Abstract :
The paper presents a set of novel algorithms for the signal modification component of concatenative text-to-speech systems. The algorithms described are based around the LPC analysis/synthesis framework, and achieve prosodic modification by time-domain processing of the LPC residual. The modified residual is then recombined with the all-pole spectral estimate to synthesise the new speech signal. The methods differ in the processing applied to the residual signal. The first method uses a modified version of TD-PSOLA, relying on assumptions of decorrelation and spectral flatness to avoid spectral distortion. The second method uses multiple windowing within each pitch period, enabling a given pitch modification to be realised by shifting several windowed segments by small amounts rather than a large shift of a single window. Again the aim is to reduce phase distortion introduced by the time-shifting process. The third method is based on a smoothly varying resampling of the residual, rather than windowed overlap-add. TD-PSOLA and the residual-based methods were subject a informal listening tests both with pitch and time-scaled natural speech, and also integrated into the signal processing stage of the BT Laureate text-to-speech system
Keywords :
concatenated codes; signal sampling; spectral analysis; speech synthesis; time-domain analysis; BT Laureate text-to-speech system; LPC analysis/synthesis framework; all-pole spectral estimate; concatenative text-to-speech systems; decorrelation; informal listening tests; multiple windowing; phase distortion; pitch period; prosodic modification; residual-based speech modification algorithms; signal modification component; smoothly varying resampling; spectral flatness; speech signal synthesis; text-to-speech synthesis; time-domain processing; time-scaled natural speech; time-shifting process; windowed segments; Algorithm design and analysis; Decorrelation; Linear predictive coding; Phase distortion; Signal processing; Signal processing algorithms; Signal synthesis; Speech synthesis; System testing; Time domain analysis;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607882