• DocumentCode
    3274007
  • Title

    A GMM based residual prediction method for voice conversion

  • Author

    Xia, Jing ; Yin, Junxun

  • Author_Institution
    South China Univ. of Technol., Guangzhou, China
  • fYear
    2005
  • fDate
    13-16 Dec. 2005
  • Firstpage
    389
  • Lastpage
    392
  • Abstract
    The purpose of a voice conversion (VC) system is to change the perceived speaker identity of a speech signal to sound as if a target speaker had spoken it. In this paper, we propose a residual prediction method for the spectral detail transformation component of a VC system. The algorithm described here is based on the LPC analysis/synthesis framework, and achieves residual prediction from LPC parameters during voiced speech. This step consists of a GMM based LPC parameter classifier and a LPC residual codebook. The predicted residual is then combined with all-pole LPC spectrum to synthesize speech signal. Several aspects of this residual prediction method, including the validation of the codebook and the performance of the coded speech are tested using objective measures. The converted speech is found nearly indistinguishable from the target speaker´s individuality in informal listening tests.
  • Keywords
    linear predictive coding; speech coding; speech synthesis; codebook; coded speech; linear prediction coefficients; residual prediction method; speech signal; synthesize speech signal; voice conversion; Filters; Linear predictive coding; Loudspeakers; Prediction methods; Signal analysis; Signal synthesis; Speech analysis; Speech synthesis; Testing; Virtual colonoscopy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Signal Processing and Communication Systems, 2005. ISPACS 2005. Proceedings of 2005 International Symposium on
  • Print_ISBN
    0-7803-9266-3
  • Type

    conf

  • DOI
    10.1109/ISPACS.2005.1595428
  • Filename
    1595428