Title :
A GMM based residual prediction method for voice conversion
Author :
Xia, Jing ; Yin, Junxun
Author_Institution :
South China Univ. of Technol., Guangzhou, China
Abstract :
The purpose of a voice conversion (VC) system is to change the perceived speaker identity of a speech signal to sound as if a target speaker had spoken it. In this paper, we propose a residual prediction method for the spectral detail transformation component of a VC system. The algorithm described here is based on the LPC analysis/synthesis framework, and achieves residual prediction from LPC parameters during voiced speech. This step consists of a GMM based LPC parameter classifier and a LPC residual codebook. The predicted residual is then combined with all-pole LPC spectrum to synthesize speech signal. Several aspects of this residual prediction method, including the validation of the codebook and the performance of the coded speech are tested using objective measures. The converted speech is found nearly indistinguishable from the target speaker´s individuality in informal listening tests.
Keywords :
linear predictive coding; speech coding; speech synthesis; codebook; coded speech; linear prediction coefficients; residual prediction method; speech signal; synthesize speech signal; voice conversion; Filters; Linear predictive coding; Loudspeakers; Prediction methods; Signal analysis; Signal synthesis; Speech analysis; Speech synthesis; Testing; Virtual colonoscopy;
Conference_Titel :
Intelligent Signal Processing and Communication Systems, 2005. ISPACS 2005. Proceedings of 2005 International Symposium on
Print_ISBN :
0-7803-9266-3
DOI :
10.1109/ISPACS.2005.1595428