Title :
High Quality Voice Conversion through Combining Modified GMM and Formant Mapping for Mandarin
Author :
Kun Liu ; Jianping Zhang ; Yonghong Yan
Author_Institution :
Chinese Acad. of Sci. (CAS), Beijing
Abstract :
A novel voice conversion system using formant mapping based on modified GMM technique is proposed in this paper. Compared with the traditional GMM technique, our modified GMM technique selects the stable frames automatically in each vowel phoneme for parameter extraction to avoid using the parameters in the transition part. With the spectral parameters extracted from the stable frames, phoneme-based GMM model is built. In the transformation procedure, the transformed formant frequencies at the main vowel phonemes are obtained using the corresponding GMM model. After this, the spectrum of the test speech is warped in frequency axis using warping functions determined with the transformed formant frequencies and original ones. Besides, fundamental frequencies, average energy and speaking rate are also transformed. Finally the converted speech is re-synthesized with the transformed parameters by high quality speech manipulation framework STRAIGHT. Perceptual results for F-M and M-F conversion show that our MOS score of the converted voice is improved from 3.8 to 4.1 and ABX score from 3.3 to 3.8 compared with IBM´s system. Comparisons with other systems are also given in this paper.
Keywords :
Gaussian processes; speech synthesis; Gaussian mixture model; Mandarin; formant mapping; high quality voice conversion system; phoneme-based GMM model; spectral parameter extraction; speech manipulation; speech re-synthesis; vowel phoneme; Acoustics; Artificial neural networks; Content addressable storage; Frequency conversion; Hidden Markov models; Parameter extraction; Piecewise linear techniques; Speech synthesis; Testing; Training data; GMM; formant mapping; voice conversion;
Conference_Titel :
Digital Telecommunications, 2007. ICDT '07. Second International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-2910-0
Electronic_ISBN :
0-7695-2910-0
DOI :
10.1109/ICDT.2007.19