Title :
Voice conversion based on improved GMM and spectrum with synchronous prosody
Author :
Bing, Zhang ; Yibiao, Yu
Author_Institution :
Sch. of Electron. & Inf. Eng., Soochow Univ., Suzhou
Abstract :
A new voice conversion approach is proposed based on improved GMM speaker model and short-time spectrum with synchronous prosody. Improved GMM speaker model which is trained by feature vector of original and target speaker can overcome over-smooth phenomenon. The short-time spectrum with prosody is composed of LSF parameter and pitch parameter. It can describe speakerpsilas vocal tract characteristics and exciting characteristics more accurately, comparing with normal methods which the pitch usually set as constant. Experimental results show this method can describe personality and transformation relationship of the source speaker and target speaker effectively. In addition, transformed speech has good quality, while speakerpsilas individuality transformed well.
Keywords :
Gaussian processes; speaker recognition; spectral analysis; speech processing; GMM speaker model; Gaussian mixture model; feature vector; linear spectrum frequency; pitch parameter; speaker vocal tract; speech quality; synchronous prosody; voice conversion; Artificial neural networks; Feature extraction; Frequency; Hidden Markov models; Linear predictive coding; Linear regression; Loudspeakers; Speech analysis; Transfer functions; Vector quantization; Improved GMM; LSF; Spectrum with prosody; Voice conversion;
Conference_Titel :
Signal Processing, 2008. ICSP 2008. 9th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-2178-7
Electronic_ISBN :
978-1-4244-2179-4
DOI :
10.1109/ICOSP.2008.4697217