DocumentCode :
3341514
Title :
Spectral voice conversion for text-to-speech synthesis
Author :
Kain, Alexander ; Macon, Michael W.
Author_Institution :
CSLU, Oregon Graduate Inst. of Sci. & Technol., Beaverton, OR, USA
Volume :
1
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
285
Abstract :
A new voice conversion algorithm that modifies a source speaker´s speech to sound as if produced by a target speaker is presented. It is applied to a residual-excited LPC text-to-speech diphone synthesizer. Spectral parameters are mapped using a locally linear transformation based on Gaussian mixture models whose parameters are trained by joint density estimation. The LPC residuals are adjusted to match the target speakers average pitch. To study effects of the amount of training on performance, data sets of varying sizes are created by automatically selecting subsets of all available diphones by a vector quantization method. In an objective evaluation, the proposed method is found to perform more reliably for small training sets than a previous approach. In perceptual tests, it was shown that nearly optimal spectral conversion performance was achieved, even with a small amount of training data. However, speech quality improved with increases in the training set size
Keywords :
Gaussian processes; linear predictive coding; parameter estimation; spectral analysis; speech synthesis; vector quantisation; Gaussian mixture models; LPC residuals; diphone synthesizer; joint density estimation; locally linear transformation; objective evaluation; perceptual tests; residual-excited LPC; spectral parameters; spectral voice conversion; speech quality; target speakers average pitch; text-to-speech synthesis; training sets; vector quantization; Linear predictive coding; Loudspeakers; Natural languages; Performance evaluation; Piecewise linear techniques; Speech synthesis; Synthesizers; Testing; Training data; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.674423
Filename :
674423
Link To Document :
بازگشت