DocumentCode :
180505
Title :
Non-parallel voice conversion using joint optimization of alignment by temporal context and spectral distortion
Author :
Benisty, Hadas ; Malah, David ; Crammer, Koby
Author_Institution :
Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
7909
Lastpage :
7913
Abstract :
Many voice conversion systems require parallel training sets of the source and target speakers. Non-parallel training is more complicated as it involves evaluation of source-target correspondence along with the conversion function itself. INCA is a recently proposed method for non-parallel training, based on iterative estimation of alignment and conversion function. The alignment is evaluated using a simple nearest-neighbor search, which often leads to phonetic miss-matched source-target pairs. We propose here a generalized approach, denoted as Temporal-Context INCA (TC-INCA), based on matching temporal context vectors. We formulate the training stage as a minimization problem of a joint cost, considering both context-based alignment and conversion function. We show that TC-INCA reduces the joint cost and prove its convergence. Experimental results indicate that TC-INCA significantly improves the alignment accuracy, compared to INCA. Moreover, subjective evaluations show that TC-INCA leads to improved quality of the synthesized output signals, when small training sets are used.
Keywords :
iterative methods; optimisation; signal processing; iterative estimation; joint optimization; minimization problem; nonparallel voice conversion; spectral distortion; temporal context; Accuracy; Context; Joints; Minimization; Speech; Training; Vectors; Gaussian Mixture Model (GMM); INCA; Non-Parallel Voice Conversion; Spectral Distance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6855140
Filename :
6855140
Link To Document :
بازگشت