DocumentCode
1653239
Title
Voice conversion based on improved GMM and spectrum with synchronous prosody
Author
Bing, Zhang ; Yibiao, Yu
Author_Institution
Sch. of Electron. & Inf. Eng., Soochow Univ., Suzhou
fYear
2008
Firstpage
659
Lastpage
662
Abstract
A new voice conversion approach is proposed based on improved GMM speaker model and short-time spectrum with synchronous prosody. Improved GMM speaker model which is trained by feature vector of original and target speaker can overcome over-smooth phenomenon. The short-time spectrum with prosody is composed of LSF parameter and pitch parameter. It can describe speakerpsilas vocal tract characteristics and exciting characteristics more accurately, comparing with normal methods which the pitch usually set as constant. Experimental results show this method can describe personality and transformation relationship of the source speaker and target speaker effectively. In addition, transformed speech has good quality, while speakerpsilas individuality transformed well.
Keywords
Gaussian processes; speaker recognition; spectral analysis; speech processing; GMM speaker model; Gaussian mixture model; feature vector; linear spectrum frequency; pitch parameter; speaker vocal tract; speech quality; synchronous prosody; voice conversion; Artificial neural networks; Feature extraction; Frequency; Hidden Markov models; Linear predictive coding; Linear regression; Loudspeakers; Speech analysis; Transfer functions; Vector quantization; Improved GMM; LSF; Spectrum with prosody; Voice conversion;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing, 2008. ICSP 2008. 9th International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-2178-7
Electronic_ISBN
978-1-4244-2179-4
Type
conf
DOI
10.1109/ICOSP.2008.4697217
Filename
4697217
Link To Document