مرکز منطقه ای اطلاع رساني علوم و فناوري - Non-parallel training for voice conversion based on adaptation method

DocumentCode :

1687346

Title :

Non-parallel training for voice conversion based on adaptation method

Author :

Peng Song ; Wenming Zheng ; Li Zhao

Author_Institution :

Sch. of Inf. Sci. & Eng., Southeast Univ., Nanjing, China

fYear :

2013

Firstpage :

6905

Lastpage :

6909

Abstract :

In this paper, we propose a simple and efficient non-parallel training scheme for voice conversion (VC). First, the speaker models are adapted from the background model using maximum a posteriori (MAP) technique. Then, by utilizing the parameters of adapted speaker models, the Gaussian normalization and mean transformation methods are proposed for VC, respectively. In addition, to improve the conversion performance of the proposed methods, a combination approach is further presented. Finally, objective and subjective experiments are carried out to evaluate the performance of the proposed scheme, the results demonstrate that our scheme can obtain comparable performance with the traditional GMM method based on parallel corpus.

Keywords :

Gaussian processes; maximum likelihood estimation; speech synthesis; GMM method; Gaussian normalization; MAP technique; adaptation method; adapted speaker models; background model; combination approach; conversion performance; maximum a posteriori; mean transformation methods; nonparallel training scheme; parallel corpus; speaker models; voice conversion; Adaptation models; Heuristic algorithms; Hidden Markov models; Speech; Speech processing; Training; Vectors; Gaussian normalization; MAP; Voice conversion; mean transformation; non-parallel training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6639000

Filename :

6639000

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1687346