DocumentCode :
323794
Title :
Source-filter models for time-scale pitch-scale modification of speech
Author :
Acero, Alex
Author_Institution :
Microsoft Corp., Redmond, WA, USA
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
881
Abstract :
This paper presents two time-scale pitch-scale modification techniques to be used in speech synthesis systems. They have been applied to Microsoft´s Whistler system, which is based on concatenative synthesis. Both methods are based on a source-filter model, one of them using LPC parameters and the other one using cepstral parameters. The proposed methods achieve high quality prosody modification, retain the characteristics of the donor speaker, allow for spectral manipulation (to reduce spectral discontinuities at unit boundaries), yield compact acoustic inventories and improved voiced fricatives
Keywords :
FIR filters; feature extraction; filtering theory; linear predictive coding; parameter estimation; spectral analysis; speech coding; speech synthesis; FIR filter; LPC parameters; Microsoft´s Whistler system; cepstral parameters; compact acoustic inventories; concatenative synthesis; donor speaker characteristics; epoch extraction; high quality prosody modification; source-filter models; spectral discontinuities reduction; spectral manipulation; speech synthesis systems; time-scale pitch-scale modification; unit boundaries; voiced fricatives; Cepstral analysis; Filters; Linear predictive coding; Loudspeakers; Man machine systems; Pulse generation; Smoothing methods; Speech processing; Speech synthesis; Synthesizers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675406
Filename :
675406
Link To Document :
بازگشت