مرکز منطقه ای اطلاع رساني علوم و فناوري - Voice conversion algorithm using phoneme Gaussian mixture model

DocumentCode :

3245413

Title :

Voice conversion algorithm using phoneme Gaussian mixture model

Author :

Sheng, Lv ; Yin Junxun ; Jiancheng, Huang

Author_Institution :

Sch. of Electron. & Inf., South China Univ. of Technol., Guangzhou, China

fYear :

2004

fDate :

20-22 Oct. 2004

Firstpage :

Lastpage :

Abstract :

This paper presents a new voice conversion algorithm which modifies the utterance of a source speaker to sound like speech from a target speaker. Our method uses speech models based on phoneme units of speech, which finds accurate alignments between source and target speaker utterances. Using the alignments, vocal tract and glottal excitation characteristics are mapped across speakers. Objective and subjective tests suggest that convincing voice conversion is achieved while maintaining high speech quality, which is comparable to other frame-based approaches.

Keywords :

Gaussian distribution; speech processing; glottal excitation characteristics; phoneme Gaussian mixture model; phoneme units; source speaker utterances; speech models; speech quality; target speaker utterances; vocal tract; voice conversion algorithm; Books; Hidden Markov models; Interpolation; Linear regression; Loudspeakers; Mice; Organizing; Smoothing methods; Speech processing; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Multimedia, Video and Speech Processing, 2004. Proceedings of 2004 International Symposium on

Print_ISBN :

0-7803-8687-6

Type :

conf

DOI :

10.1109/ISIMP.2004.1433986

Filename :

1433986

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3245413