• DocumentCode
    535160
  • Title

    Voice conversion method based on State-Space Model

  • Author

    Sun, Jian ; Wang, Jinming ; Zhang, Xiongwei ; Peng, Danwen

  • Author_Institution
    Inst. of Commun. Eng., PLA Univ. of Sci. & Tech. Nanjing, Nanjing, China
  • Volume
    8
  • fYear
    2010
  • fDate
    16-18 Oct. 2010
  • Firstpage
    3542
  • Lastpage
    3546
  • Abstract
    This paper presents a novel method based on State Space Model (SSM) for voice conversion. The SSM has the ability to model the spectral parameter trajectory of speech which is desirable in voice conversion. Traditional training method for SSM is EM (Expectation Maximum) algorithm. Due to its low convergence speed, the training time is very long. Moreover increasing the spectral parameter order which helps to improve the speech quality will also cause SSM training difficulties. These problems are the underlying motivations of our work. In this paper we develop an efficient and robust training procedure based on LSF (Line Spectral Frequencies) segmentation method and subspace training algorithm. The objective and subjective experiments demonstrate the superior performance of the training approach as compared with EM method.
  • Keywords
    speech processing; state-space methods; vector quantisation; voice communication; expectation maximum algorithm; line spectral frequency segmentation; spectral parameter trajectory; speech; state space model; subspace training algorithm; voice conversion method; Complexity theory; Computational modeling; Equations; Mathematical model; Parameter estimation; Speech; Training; EM algorithm; LSF; SSM(State-Space Model); segmentation; subspace method;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing (CISP), 2010 3rd International Congress on
  • Conference_Location
    Yantai
  • Print_ISBN
    978-1-4244-6513-2
  • Type

    conf

  • DOI
    10.1109/CISP.2010.5647132
  • Filename
    5647132