DocumentCode
312232
Title
Training data selection for voice conversion using speaker selection and vector field smoothing
Author
Hashimoto, Makoto ; Higuchi, Norio
Author_Institution
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
Volume
3
fYear
1996
fDate
3-6 Oct 1996
Firstpage
1397
Abstract
We have previously proposed a spectral mapping method (SSVFS), for the purpose of voice conversion with a small amount of training data using speaker selection and vector held smoothing techniques. It has already been shown that SSVFS is effective for spectral mapping by both objective and subjective evaluations, and that it can operate with a very small amount of training data-as little as only one word (Hashimoto and Higuchi, 1995). We propose a criterion for selecting effective training data for SSVFS. We define coverage of parameter space with respect to the training procedure of SSVFS as the criterion. This criterion is useful not only for the selection of effective training samples, which is important for the efficient learning of spectral characteristics, but also for the estimation of the degree to which learning is carried out. To evaluate the validity of the proposed criterion, we measured the correlation between spectral resemblance and coverage. The result showed that the mean correlation coefficient for eight target speakers is -0.74 with the proposed criterion, and -0.59 without consideration of the training procedure. We conclude that the proposed criterion is useful in selecting effective training samples for SSVFS
Keywords
learning (artificial intelligence); natural language interfaces; spectral analysis; speech synthesis; vectors; SSVFS; learning; mean correlation coefficient; objective evaluations; parameter space; speaker selection; spectral characteristics; spectral coverage; spectral mapping method; spectral resemblance; speech synthesis; subjective evaluations; training data selection; vector field smoothing; voice conversion; Interpolation; Loudspeakers; Smoothing methods; Speech synthesis; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607875
Filename
607875
Link To Document