DocumentCode
3274007
Title
A GMM based residual prediction method for voice conversion
Author
Xia, Jing ; Yin, Junxun
Author_Institution
South China Univ. of Technol., Guangzhou, China
fYear
2005
fDate
13-16 Dec. 2005
Firstpage
389
Lastpage
392
Abstract
The purpose of a voice conversion (VC) system is to change the perceived speaker identity of a speech signal to sound as if a target speaker had spoken it. In this paper, we propose a residual prediction method for the spectral detail transformation component of a VC system. The algorithm described here is based on the LPC analysis/synthesis framework, and achieves residual prediction from LPC parameters during voiced speech. This step consists of a GMM based LPC parameter classifier and a LPC residual codebook. The predicted residual is then combined with all-pole LPC spectrum to synthesize speech signal. Several aspects of this residual prediction method, including the validation of the codebook and the performance of the coded speech are tested using objective measures. The converted speech is found nearly indistinguishable from the target speaker´s individuality in informal listening tests.
Keywords
linear predictive coding; speech coding; speech synthesis; codebook; coded speech; linear prediction coefficients; residual prediction method; speech signal; synthesize speech signal; voice conversion; Filters; Linear predictive coding; Loudspeakers; Prediction methods; Signal analysis; Signal synthesis; Speech analysis; Speech synthesis; Testing; Virtual colonoscopy;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Signal Processing and Communication Systems, 2005. ISPACS 2005. Proceedings of 2005 International Symposium on
Print_ISBN
0-7803-9266-3
Type
conf
DOI
10.1109/ISPACS.2005.1595428
Filename
1595428
Link To Document