DocumentCode
2067254
Title
Automatic Prosody Boundary Labeling of Mandarin Using Both Text and Acoustic Information
Author
Ni, Chongjia ; Liu, Wenju ; Xu, Bo
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
Prosody is an important factor for a high quality text-to- speech (TTS) system. Prosody is often described with a hierarchical structure. So the generation of the hierarchical prosody structure is very important both in the corpus building and the real-time text analysis, but the prosody labeling procedure is laborious and time consuming. In this paper, an automatic prosody boundary label system is presented, in which the classification and regression tree (CART) framework is used. In this system, we build a prosody model using acoustic information and the text information based on large speech corpus with prosodic structure label (ASCCD). Experiments show this model can achieve prosody boundary detection 90.86% accuracy.
Keywords
natural language processing; regression analysis; speech synthesis; trees (mathematics); Mandarin; acoustic information; automatic prosody boundary labeling; classification and regression tree framework; high quality text-to-speech system; prosodic structure label; real-time text analysis; speech corpus; text information; Automatic speech recognition; Buildings; Decision trees; Hidden Markov models; Labeling; Laboratories; Loudspeakers; Natural languages; Pattern recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2942-4
Electronic_ISBN
978-1-4244-2943-1
Type
conf
DOI
10.1109/CHINSL.2008.ECP.100
Filename
4730354
Link To Document