Title :
Research on voice conversion based codebook and GMM
Author :
Xie, Wei-chao ; Zhang, Ling-hua
Author_Institution :
Coll. of Telecommun. & Inf. Eng., Nanjing Univ. of Posts & Telecommun., Nanjing, China
Abstract :
Voice conversion (VC) is a technique used in order to change the personality characteristics of a source speaker´s voice into the target speaker´s, while preserving the original semantic information. This paper mainly studies a method of voice conversion with better quality by codebook. Firstly, personality parameters of both source and target speaker are time aligned to create the source and target codebook. Then compare the parameters which to be converted with the source codebook. If they are near enough, the corresponding parameters of target codebook are regarded as the converted parameters. Otherwise, we will use GMM to realize the conversion of LSF parameters and get residual excitation signal by pitch frequency estimated from converted LSF parameters. This method is better than the conversion only using GMM.
Keywords :
Gaussian processes; speaker recognition; speech coding; Gaussian mixture models; personality characteristics; residual excitation signal; semantic information; source codebook; target codebook; voice conversion based codebook; Broadband communication; Databases; Discrete cosine transforms; Speech; Speech coding;
Conference_Titel :
Communication Technology (ICCT), 2010 12th IEEE International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-6868-3
DOI :
10.1109/ICCT.2010.5689017