DocumentCode :
2093517
Title :
Research on voice conversion based codebook and GMM
Author :
Xie, Wei-chao ; Zhang, Ling-hua
Author_Institution :
Coll. of Telecommun. & Inf. Eng., Nanjing Univ. of Posts & Telecommun., Nanjing, China
fYear :
2010
fDate :
11-14 Nov. 2010
Firstpage :
1403
Lastpage :
1406
Abstract :
Voice conversion (VC) is a technique used in order to change the personality characteristics of a source speaker´s voice into the target speaker´s, while preserving the original semantic information. This paper mainly studies a method of voice conversion with better quality by codebook. Firstly, personality parameters of both source and target speaker are time aligned to create the source and target codebook. Then compare the parameters which to be converted with the source codebook. If they are near enough, the corresponding parameters of target codebook are regarded as the converted parameters. Otherwise, we will use GMM to realize the conversion of LSF parameters and get residual excitation signal by pitch frequency estimated from converted LSF parameters. This method is better than the conversion only using GMM.
Keywords :
Gaussian processes; speaker recognition; speech coding; Gaussian mixture models; personality characteristics; residual excitation signal; semantic information; source codebook; target codebook; voice conversion based codebook; Broadband communication; Databases; Discrete cosine transforms; Speech; Speech coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication Technology (ICCT), 2010 12th IEEE International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-6868-3
Type :
conf
DOI :
10.1109/ICCT.2010.5689017
Filename :
5689017
Link To Document :
بازگشت