DocumentCode
535160
Title
Voice conversion method based on State-Space Model
Author
Sun, Jian ; Wang, Jinming ; Zhang, Xiongwei ; Peng, Danwen
Author_Institution
Inst. of Commun. Eng., PLA Univ. of Sci. & Tech. Nanjing, Nanjing, China
Volume
8
fYear
2010
fDate
16-18 Oct. 2010
Firstpage
3542
Lastpage
3546
Abstract
This paper presents a novel method based on State Space Model (SSM) for voice conversion. The SSM has the ability to model the spectral parameter trajectory of speech which is desirable in voice conversion. Traditional training method for SSM is EM (Expectation Maximum) algorithm. Due to its low convergence speed, the training time is very long. Moreover increasing the spectral parameter order which helps to improve the speech quality will also cause SSM training difficulties. These problems are the underlying motivations of our work. In this paper we develop an efficient and robust training procedure based on LSF (Line Spectral Frequencies) segmentation method and subspace training algorithm. The objective and subjective experiments demonstrate the superior performance of the training approach as compared with EM method.
Keywords
speech processing; state-space methods; vector quantisation; voice communication; expectation maximum algorithm; line spectral frequency segmentation; spectral parameter trajectory; speech; state space model; subspace training algorithm; voice conversion method; Complexity theory; Computational modeling; Equations; Mathematical model; Parameter estimation; Speech; Training; EM algorithm; LSF; SSM(State-Space Model); segmentation; subspace method;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Signal Processing (CISP), 2010 3rd International Congress on
Conference_Location
Yantai
Print_ISBN
978-1-4244-6513-2
Type
conf
DOI
10.1109/CISP.2010.5647132
Filename
5647132
Link To Document