Title : 
Speech prosody control using weighted neural network ensembles
         
        
            Author : 
Romsdorfer, Harald
         
        
            Author_Institution : 
Speech Process. Group, ETH Zurich, Zurich, Switzerland
         
        
        
        
        
        
            Abstract : 
Ensembles of artificial neural networks (ANNs) show improved generalization capabilities that outperform those of single networks. However, for aggregation to be effective, the individual networks must be as accurate and diverse as possible. This paper presents a new statistical model for prosody control that combines weighted ensembles of ANNs with feature relevance determination. This approach allows the individual networks to be accurate and diverse. The weighted neural network ensemble model was applied for both, phone duration modeling and fundamental frequency modeling. A comparison with state-of-the-art prosody models based on classification and regression trees (CART), multivariate adaptive regression splines (MARS), or ANN, shows a 12% improvement compared to the best duration model and a 24% improvement compared to the best F0 model. The neural network ensemble model also outperforms another, recently presented ensemble model based on gradient tree boosting.
         
        
            Keywords : 
gradient methods; neural nets; regression analysis; speech synthesis; splines (mathematics); trees (mathematics); artificial neural networks; classification trees; feature relevance determination; fundamental frequency modeling; gradient tree boosting; multivariate adaptive regression splines; phone duration modeling; regression trees; speech prosody control; speech synthesis; statistical model; weighted neural network ensembles; Artificial neural networks; Feedforward systems; Frequency; Neural networks; Predictive models; Regression tree analysis; Speech processing; Speech synthesis; Training data; Weight control; ensemble models; neural networks; prosody control; speech synthesis;
         
        
        
        
            Conference_Titel : 
Machine Learning for Signal Processing, 2009. MLSP 2009. IEEE International Workshop on
         
        
            Conference_Location : 
Grenoble
         
        
            Print_ISBN : 
978-1-4244-4947-7
         
        
            Electronic_ISBN : 
978-1-4244-4948-4
         
        
        
            DOI : 
10.1109/MLSP.2009.5306247