DocumentCode :
2551850
Title :
An overview of text-to-speech synthesis
Author :
Acero, Alex
Author_Institution :
Speech Technol. Group, Microsoft Corp., Redmond, WA, USA
fYear :
2000
fDate :
2000
Firstpage :
1
Abstract :
Summary form only given. The article gives an overview of text-to-speech (TTS) technology and a description of some issues of potential interest to speech coding experts. After motivation for the use of TTS technology, it describes the general architecture of a text-to-speech system with particular emphasis on the speech synthesis component. Both formant synthesis and concatenative synthesis are presented, offering different degrees of flexibility and quality. Several well-known speech coding techniques (including LPC vocoders, waveform interpolation, harmonic coding, and layered coding) have been used in speech synthesis. It explains how they have been applied, and the advantages and limitations of those techniques when used in speech synthesis. The main goal is to increase cooperation between the speech coding community and the TTS community, and in particular to motivate the need for speech coding algorithms that meet the requirements of the next generation speech synthesis technology
Keywords :
interpolation; linear predictive coding; speech coding; speech synthesis; vocoders; LPC vocoders; concatenative synthesis; formant synthesis; harmonic coding; layered coding; speech coding algorithms; speech quality; speech synthesis; speech synthesis technology; text-to-speech synthesis; text-to-speech system architecture; text-to-speech technology; waveform interpolation; Interpolation; Linear predictive coding; Speech coding; Speech synthesis; Vocoders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Coding, 2000. Proceedings. 2000 IEEE Workshop on
Conference_Location :
Delavan, WI
Print_ISBN :
0-7803-6416-3
Type :
conf
DOI :
10.1109/SCFT.2000.878372
Filename :
878372
Link To Document :
بازگشت