DocumentCode
2838328
Title
Automatic detection of Chinese accent-index based on approximation-ratio
Author
Zhu, Weibin ; Zhang, Wei ; Shi, Qin ; Ma, Xijun ; Shen, Liqin
Author_Institution
IBM China Res. Lab, Beijing, China
fYear
2004
fDate
15-18 Dec. 2004
Firstpage
85
Lastpage
88
Abstract
For a TTS system, to synthesize speech with better prosody, accent information is expected to be involved. Therefore, we defined a set of accent indexes (AI) to represent the variances of accent in Chinese speech, and proposed a novel method to automatically annotate Chinese speech with the AI. In the method, a parameter, named approximation-ratio, was used to numerically indicate the accent of a prosodic unit. And the value of AI was the discretization of the approximation-ratio. One corpus was annotated with AI by the method. And with the corpus, a refined prosody parameter prediction model was built. The experimental results showed that prosody parameters predicted by the refined model were closer to those of real speech than the former model without AI. Further, a perceptual evaluation showed that the accent manifestation generated by the AI-ready synthesizer was distinguishable and acceptable.
Keywords
feature extraction; parameter estimation; speech intelligibility; speech synthesis; Chinese accent index; TTS system; approximation ratio; automatic annotation; automatic detection; perceptual evaluation; prosody parameter prediction model; speech distinguishability; speech synthesis; Acoustic measurements; Artificial intelligence; Concatenated codes; Labeling; Predictive models; Speech analysis; Speech synthesis; Stress; Synthesizers; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN
0-7803-8678-7
Type
conf
DOI
10.1109/CHINSL.2004.1409592
Filename
1409592
Link To Document