DocumentCode
2703246
Title
Cost Reduction of Training Mapping Function Based on Multistep Voice Conversion
Author
Masuda, T. ; Shozakai, M.
Author_Institution
New Bus. Dev., Asahi Kasei Corp., Kanagawa, Japan
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
Several approaches based on a statistical method for voice conversion from one speaker to another have been developed. In a statistical spectral mapping method which is a typical one in these approaches, a mapping function which represents a correlation between different speakers is determined using spectral features. This technique has the problem that it is necessary to train the mapping function for each speaker pair. The training cost must become a serious issue in case that the number of speakers increases significantly. This paper describes a novel voice conversion method for reducing the training cost. This technique is easily implemented and can use conventional techniques directly. Experimental results demonstrate that the converted speech is almost maintaining the conventional quality despite the significant training cost reduction by the proposed method.
Keywords
feature extraction; speech synthesis; correlation; multistep voice conversion; spectral features; statistical spectral mapping method; training cost reduction; training mapping function; Cost function; Maximum likelihood estimation; Speech enhancement; Speech synthesis; Statistical analysis; Vector quantization; Voice conversion; multistep voice conversion; speech synthesis; training cost;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.367007
Filename
4218195
Link To Document