Title :
Hierarchical prosodic boundary prediction for Uyghur TTS
Author :
Hamdulla, Askar ; Mamateli, Guljamal ; Rozi, A. ; Imam, Seyyare
Author_Institution :
Sch. of Software, Xinjiang Univ., Urumqi, China
Abstract :
Correct prosodic boundary prediction is crucial for the quality of synthesized speech. This paper presents the prosodic hierarchy of Uyghur-language which belongs to agglutinative language. A two-layer bottom-up hierarchical approach based on conditional random fields (CRF) is used for predicting prosodic word (PW) and prosodic phrase (PP) boundaries. In order to disambiguate the confusion between different prosodic boundaries at punctuation sites, CRF based prosodic boundary determination model is used and integrated with bottom-up hierarchical approach. Word suffix feature is considered useful for prosodic boundary prediction and added into the feature sets. The experimental results show that the proposed method successfully resolves the confusion between different prosodic boundaries. Consequently, further enhance the accuracy of prosodic boundary prediction.
Keywords :
natural language processing; speech synthesis; CRF based prosodic boundary determination model; PP boundary; PW boundary; Uyghur TTS; Uyghur-language; agglutinative language; conditional random fields; correct prosodic boundary prediction; hierarchical prosodic boundary prediction; prosodic phrase boundary; prosodic word prediction; synthesized speech quality; text-to-speech synthesis; two-layer bottom-up hierarchical approach; Educational institutions; Hidden Markov models; Indium phosphide; Predictive models; Speech; Syntactics;
Conference_Titel :
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location :
Hollywood, CA
Print_ISBN :
978-1-4673-4863-8