DocumentCode :
3164523
Title :
Punctuation generation inspired linguistic features for mandarin prosodic boundary prediction
Author :
Chiang, Chen-Yu ; Wang, Yih-Ru ; Chen, Sin-Horng
Author_Institution :
Inst. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4597
Lastpage :
4600
Abstract :
A novel statistical linguistic feature, called punctuation confidence, is proposed in this paper for assisting in prosodic break prediction in Mandarin text-to-speech. The punctuation confidence calculated from the input text is a measure of the likelihood of inserting a major PM at a word boundary. Since a punctuation in text tends to be pronounced as a break, the punctuation confidence associated with a punctuation estimate should provide useful information for break prediction from text. The idea is realized in this study by first employing a conditional random field (CRF)-based model to generate a predicted punctuation and its associated punctuation confidence for each word boundary. Then, the predicted punctuation and its punctuation confidence are combined with contextual linguistic features to predict the break type of the word boundary by an MLP (multi-layer perceptrons). Experiment on the Treebank speech corpus confirmed the effectiveness of the proposed approach.
Keywords :
linguistics; multilayer perceptrons; speech synthesis; statistical analysis; Mandarin prosodic boundary prediction; Mandarin text-to-speech; Treebank speech corpus; break type; conditional random field-based model; contextual linguistic features; multilayer perceptrons; predicted punctuation; prosodic break prediction; punctuation confidence; punctuation estimate; punctuation generation inspired linguistic features; statistical linguistic feature; word boundary; Context; Generators; Labeling; Pragmatics; Predictive models; Speech; Syntactics; conditional random field; prosodic break; punctuation confidence; punctuation generation; text-to-speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288942
Filename :
6288942
Link To Document :
بازگشت