Title : 
Frame correlation based autoregressive GMM method for voice conversion
         
        
            Author : 
Xian Li ; Zeng-Fu Wang
         
        
            Author_Institution : 
Dept. of Autom., Univ. of Sci. & Technol. of China, Hefei, China
         
        
        
        
        
        
            Abstract : 
In this paper, we present a frame correlation based autoregressive GMM method for voice conversion. In our system, the cross-frame correlation of the source feature is modeled with augmented delta features, and the cross-frame correlation of target feature is modeled by autoregressive models. The expectation maximization (EM) algorithm is used for the model training, and a maximum likelihood parameter conversion algorithm is then employed to convert the feature of a source speaker into the one of a target speaker frame by frame. This method is consistent in training and conversion by using target feature´s cross-frame correlation explicitly at both stage. The experimental results show that the proposed method has excellent performance. The test set log probability of it is higher than the GMM-DYN (GMM with dynamic features) method, and the subjective evaluation results of it are also comparable to the GMM-DYN method. Furthermore, it is much more suitable for low-latency application.
         
        
            Keywords : 
Gaussian processes; autoregressive processes; expectation-maximisation algorithm; mixture models; speech synthesis; EM algorithm; augmented delta features; cross-frame correlation; expectation maximization algorithm; frame correlation based autoregressive GMM method; maximum likelihood parameter conversion algorithm; model training; source feature; source speaker feature; target feature cross-frame correlation; target speaker; voice conversion; Correlation; Hidden Markov models; Maximum likelihood estimation; Speech; Training; Trajectory; Vectors; speech synthesis; voice conversion;
         
        
        
        
            Conference_Titel : 
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
         
        
            Conference_Location : 
Singapore
         
        
        
            DOI : 
10.1109/ISCSLP.2014.6936610