• DocumentCode
    2703246
  • Title

    Cost Reduction of Training Mapping Function Based on Multistep Voice Conversion

  • Author

    Masuda, T. ; Shozakai, M.

  • Author_Institution
    New Bus. Dev., Asahi Kasei Corp., Kanagawa, Japan
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    Several approaches based on a statistical method for voice conversion from one speaker to another have been developed. In a statistical spectral mapping method which is a typical one in these approaches, a mapping function which represents a correlation between different speakers is determined using spectral features. This technique has the problem that it is necessary to train the mapping function for each speaker pair. The training cost must become a serious issue in case that the number of speakers increases significantly. This paper describes a novel voice conversion method for reducing the training cost. This technique is easily implemented and can use conventional techniques directly. Experimental results demonstrate that the converted speech is almost maintaining the conventional quality despite the significant training cost reduction by the proposed method.
  • Keywords
    feature extraction; speech synthesis; correlation; multistep voice conversion; spectral features; statistical spectral mapping method; training cost reduction; training mapping function; Cost function; Maximum likelihood estimation; Speech enhancement; Speech synthesis; Statistical analysis; Vector quantization; Voice conversion; multistep voice conversion; speech synthesis; training cost;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.367007
  • Filename
    4218195