DocumentCode
394333
Title
Syllable clustering and spectral discontinuity in syllable-based TTS systems
Author
Chen, Fangxin
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
The paper examines the spectral discontinuity problem existing in syllable-based Chinese TTS (text-to-speech) systems. Acoustic and phonetic investigations showed that, in natural speech, syllables with approximant, nasal or vowel as onset have a tendency to form syllable clusters with their preceding syllables due to the strong coarticulation effect. In speech synthesis, syllable clusters are the major source for audible spectral discontinuity. The implication of this finding for improving syllable-based TTS voice quality is discussed.
Keywords
natural languages; pattern clustering; speech; speech synthesis; Chinese TTS systems; acoustic investigations; coarticulation effect; natural speech; phonetic investigations; spectral discontinuity; speech synthesis; syllable clustering; text-to-speech systems; voice quality; Acoustic noise; Clustering algorithms; Energy states; Laboratories; Loudspeakers; Natural languages; Spectrogram; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198874
Filename
1198874
Link To Document