DocumentCode
417103
Title
Voice characteristics conversion for TTS using reverse VTLN
Author
Eichner, Matthias ; Wolff, Matthias ; Hoffmann, Rüdiger
Author_Institution
Dresden Univ. of Technol., Germany
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
In the past, several approaches have been proposed for voice conversion in TTS systems. Mostly, conversion is done by modification of the spectral properties and pitch to match a certain target voice. This conversion causes distortions that deteriorate the quality of the synthesized speech. In this paper we investigate a very simple and straightforward method for voice conversion. It generates a new voice from the source speaker instead of generating a certain target speaker´s voice. For application in TTS systems it is often sufficient to synthesize new voices that sound sufficiently different to be distinguishable from each other. This is done by applying a spectral warping technique that is commonly used for speaker normalization in speech recognition systems called vocal tract length normalization (VTLN). Due to the low requirements of resources this method is especially suited for embedded systems.
Keywords
embedded systems; spectral analysis; speech recognition; speech synthesis; TTS; embedded systems; reverse VTLN; source speaker; speaker normalization; spectral warping technique; speech recognition systems; vocal tract length normalization; voice characteristics conversion; Acoustic distortion; Character recognition; Databases; Embedded system; Loudspeakers; Signal processing; Signal synthesis; Speech recognition; Speech synthesis; Synthesizers;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325911
Filename
1325911
Link To Document