Title :
Voice conversion by combining frequency warping with unit selection
Author :
Shuang, Zhiwei ; Meng, Fanping ; Qin, Yong
Author_Institution :
China Res. Lab., IBM, Hong Kong
fDate :
March 31 2008-April 4 2008
Abstract :
In this paper, we propose a novel voice conversion method by combining frequency warping and unit selection to improve the similarity to target speaker. We use frequency warping to get the warped source spectrum, which will be used as estimated target for later unit selection of the target speaker´s spectrum. Such estimated target can preserve the natural transition of human´s speech. Then, part of the warped source spectrum is replaced by the selected target speaker´s real spectrum before reconstructing the converted speech to reduce the difference in detailed spectrum. TC- STAR 2007 voice conversion evaluation results show that the proposed method can achieve about 20% improvement in similarity score compared to only frequency warping.
Keywords :
speech processing; TC- STAR 2007; frequency warping; human speech natural transition; source spectrum; target speaker spectrum; unit selection; voice conversion method; Character generation; Degradation; Diversity reception; Frequency conversion; Frequency estimation; Motion pictures; Smoothing methods; Speech analysis; Speech synthesis; Training data; Selection; Voice Conversion; Warping;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518696