DocumentCode :
3222313
Title :
Voice Conversion Using Canonical Correlation Analysis Based on Gaussian Mixture Model
Author :
Jian, ZhiHua ; Yang, Zhen
Author_Institution :
Nanjing Univ. of Posts & Telecommun., Nanjing
Volume :
1
fYear :
2007
fDate :
July 30 2007-Aug. 1 2007
Firstpage :
210
Lastpage :
215
Abstract :
A novel algorithm for voice conversion is proposed in this paper. The mapping function of spectral vectors of the source and target speakers is calculated by the canonical correlation analysis (CCA) estimation based on Gaussian mixture models. Since the spectral envelope feature remains a majority of second order statistical information contained in speech after linear prediction (LPC) analysis, the CCA method is more suitable for spectral conversion than MMSE because CCA explicitly considers the variance of each component of the spectral vectors during conversion procedure. Both subjective and objective evaluations are conducted. The experimental results demonstrate that the proposed scheme can achieve better performance than the previous method which uses MMSE estimation criterion.
Keywords :
Gaussian processes; correlation methods; speech processing; Gaussian mixture model; canonical correlation analysis estimation; linear prediction analysis; second order statistical information; source speaker; spectral envelope feature; spectral vector mapping function; speech; target speaker; voice conversion algorithm; Analysis of variance; Artificial neural networks; Hidden Markov models; Information analysis; Linear predictive coding; Loudspeakers; Signal processing algorithms; Speech analysis; Speech synthesis; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-0-7695-2909-7
Type :
conf
DOI :
10.1109/SNPD.2007.133
Filename :
4287504
Link To Document :
بازگشت