Title :
Segmenting unrestricted Chinese text into prosodic words instead of lexical words
Author :
Qian, Yao ; Chu, Min ; Peng, Hu
Author_Institution :
Shanghai Normal Univ., China
Abstract :
This paper stresses the importance of converting a string of lexical words to that of prosodic words in text-to-speech (TTS) systems by presenting the surface differences and perceptual differences between them. A statistical rule based method and a classification and regression tree (CART) based method are proposed as solutions. Though ComplicatedSet based CART method performs the best, the achievement is obtained at the cost of heavy computation workloads needed by a parser. Statistical rule based method results in higher recall but lower precision, comparing to SimpleSet CART method. It is very difficult to tell which is better, since we don´t know which affects naturalness more, precision or recall. Both of them require only lexicon word segmentation and part of speech (POS) tagging in the preprocessing stage, and are easily realized in TTS systems. Results of the preference test discloses that significant improvements on naturalness are perceived when lexical word strings are converted into prosodic word strings by our approach
Keywords :
speech synthesis; statistical analysis; CART based method; ComplicatedSet based CART method; POS tagging; SimpleSet CART method; TTS systems; classification and regression tree; lexical word strings; lexicon word segmentation; part-of-speech tagging; perceptual differences; preference test; prosodic word strings; statistical rule based method; surface differences; text-to-speech systems; unrestricted Chinese text; Asia; Computational efficiency; Natural languages; Rhythm; Speech synthesis; Stress; Tagging; Testing; Text analysis; Zirconium;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941042