DocumentCode :
394333
Title :
Syllable clustering and spectral discontinuity in syllable-based TTS systems
Author :
Chen, Fangxin
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
The paper examines the spectral discontinuity problem existing in syllable-based Chinese TTS (text-to-speech) systems. Acoustic and phonetic investigations showed that, in natural speech, syllables with approximant, nasal or vowel as onset have a tendency to form syllable clusters with their preceding syllables due to the strong coarticulation effect. In speech synthesis, syllable clusters are the major source for audible spectral discontinuity. The implication of this finding for improving syllable-based TTS voice quality is discussed.
Keywords :
natural languages; pattern clustering; speech; speech synthesis; Chinese TTS systems; acoustic investigations; coarticulation effect; natural speech; phonetic investigations; spectral discontinuity; speech synthesis; syllable clustering; text-to-speech systems; voice quality; Acoustic noise; Clustering algorithms; Energy states; Laboratories; Loudspeakers; Natural languages; Spectrogram; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198874
Filename :
1198874
Link To Document :
بازگشت