Title :
Voice conversion algorithm using phoneme Gaussian mixture model
Author :
Sheng, Lv ; Yin Junxun ; Jiancheng, Huang
Author_Institution :
Sch. of Electron. & Inf., South China Univ. of Technol., Guangzhou, China
Abstract :
This paper presents a new voice conversion algorithm which modifies the utterance of a source speaker to sound like speech from a target speaker. Our method uses speech models based on phoneme units of speech, which finds accurate alignments between source and target speaker utterances. Using the alignments, vocal tract and glottal excitation characteristics are mapped across speakers. Objective and subjective tests suggest that convincing voice conversion is achieved while maintaining high speech quality, which is comparable to other frame-based approaches.
Keywords :
Gaussian distribution; speech processing; glottal excitation characteristics; phoneme Gaussian mixture model; phoneme units; source speaker utterances; speech models; speech quality; target speaker utterances; vocal tract; voice conversion algorithm; Books; Hidden Markov models; Interpolation; Linear regression; Loudspeakers; Mice; Organizing; Smoothing methods; Speech processing; Vector quantization;
Conference_Titel :
Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on
Print_ISBN :
0-7803-8687-6
DOI :
10.1109/ISIMP.2004.1433986