Title :
Context features based pre-selection and weight prediction in concatenation speech synthesis system
Author :
Shanfeng Liu ; Zhengqi Wen ; Ya Li ; Jianghua Tao ; Bin Liu
Author_Institution :
Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
Abstract :
How to generate natural-sounding synthesized speech has been challenging all the researchers in speech synthesis area. Experiments show that speech concatenated by units selected from large speech corpus has a better performance. However how to limit the searching space and predict weights when calculating target cost is an important problem. This paper presents a detailed hierarchical pre-selection method to limit the searching of space. After three layers of pre-selection, a set of units are selected as the candidate units. In order to ensure the continuity in the duration, the prediction model is used in the hierarchical pre-selection. Meanwhile, M5P algorithm which is combined with decision tree and regression is presented in this paper to predict weights needed in target cost calculation. Experimental result shows that these two approaches can generate high quality speech.
Keywords :
decision trees; regression analysis; speech synthesis; text analysis; M5P algorithm; concatenation speech synthesis system; context feature based preselection; decision tree; hierarchical pre-selection method; high quality speech; natural-sounding synthesized speech; regression analysis; searching space; speech corpus; weight prediction; Acoustics; Context; Databases; Prediction algorithms; Predictive models; Speech; Speech synthesis; concatenation speech synthesis; hierarchical pre-selection; weight prediction;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
DOI :
10.1109/ISCSLP.2014.6936611