DocumentCode :
2020205
Title :
A text-to-speech system for Spanish with a frequency domain based prosodic modification algorithm
Author :
Banga, E.R. ; Lopez-Gonzalo, E. ; Garcia-Mateo, C.
Author_Institution :
DTC-ETSI Telecomunicacion, Vigo Univ., Spain
Volume :
2
fYear :
1993
fDate :
27-30 April 1993
Firstpage :
183
Abstract :
From the input text, the linguistic-prosodic module obtains the phonetic transcription and prosodic marks that reflect both the syntactic structure and some rhythmical constraints. The synthesis module is a variation of the MBE (multiband excitation) vocoder with an LPC (linear predictive coding) filter that is very flexible for prosodic modifications. From a parametrized acoustic database, the algorithm decodes the speech units and modifies their prosody in a single process. The frequency baseness of the synthesis algorithm allows a fine pitch modification without spectral envelope distortion. The prosody modeling is done using the acoustic module by a close copy stylization method.<>
Keywords :
audio acoustics; frequency-domain synthesis; linear predictive coding; speech synthesis; vocoders; LPC; Spanish; acoustic database; close copy stylization; fine pitch modification; linear predictive coding; linguistic-prosodic module; multiband excitation; phonetic transcription; prosodic modification algorithm; rhythmical constraints; syntactic structure; text-to-speech system; vocoder; Acoustic distortion; Databases; Decoding; Frequency domain analysis; Frequency synthesizers; Linear predictive coding; Nonlinear filters; Speech processing; Speech synthesis; Vocoders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.1993.319264
Filename :
319264
Link To Document :
بازگشت