DocumentCode :
1687346
Title :
Non-parallel training for voice conversion based on adaptation method
Author :
Peng Song ; Wenming Zheng ; Li Zhao
Author_Institution :
Sch. of Inf. Sci. & Eng., Southeast Univ., Nanjing, China
fYear :
2013
Firstpage :
6905
Lastpage :
6909
Abstract :
In this paper, we propose a simple and efficient non-parallel training scheme for voice conversion (VC). First, the speaker models are adapted from the background model using maximum a posteriori (MAP) technique. Then, by utilizing the parameters of adapted speaker models, the Gaussian normalization and mean transformation methods are proposed for VC, respectively. In addition, to improve the conversion performance of the proposed methods, a combination approach is further presented. Finally, objective and subjective experiments are carried out to evaluate the performance of the proposed scheme, the results demonstrate that our scheme can obtain comparable performance with the traditional GMM method based on parallel corpus.
Keywords :
Gaussian processes; maximum likelihood estimation; speech synthesis; GMM method; Gaussian normalization; MAP technique; adaptation method; adapted speaker models; background model; combination approach; conversion performance; maximum a posteriori; mean transformation methods; nonparallel training scheme; parallel corpus; speaker models; voice conversion; Adaptation models; Heuristic algorithms; Hidden Markov models; Speech; Speech processing; Training; Vectors; Gaussian normalization; MAP; Voice conversion; mean transformation; non-parallel training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639000
Filename :
6639000
Link To Document :
بازگشت