• DocumentCode
    3423945
  • Title

    Voice Conversion using structrued Gaussian Mixture Model

  • Author

    Zeng, Daojian ; Yu, Yibiao

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Soochow Univ., Suzhou, China
  • fYear
    2010
  • fDate
    24-28 Oct. 2010
  • Firstpage
    541
  • Lastpage
    544
  • Abstract
    Gaussian Mixture Model (GMM) is commonly used in voice conversion. However, traditional GMM based voice conversion usually extracts a conversion function from parallel corpus, which greatly limits the application of the technology. In an attempt to overcome this drawback, structured Gaussian Mixture Model (SGMM) is applied to model the speaker´s acoustic feature distribution. In particular, two speakers´ isolated SGMMs are aligned based on Acoustic Universal Structure (AUS) theory. Then the conversion function is extracted from two aligned SGMMs in a manner similar to conventional method. The subjective listening tests indicate that the proposed method achieves equivalent speech quality and speaker individuality compared with conventional method.
  • Keywords
    Gaussian processes; speaker recognition; acoustic universal structure theory; listening tests; speaker acoustic feature distribution; speaker individuality; speech quality; structured Gaussian mixture model; voice conversion; Acoustic distortion; Acoustics; Feature extraction; Hidden Markov models; Joints; Speech; Training; AUS; SGMM; voice conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing (ICSP), 2010 IEEE 10th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-5897-4
  • Type

    conf

  • DOI
    10.1109/ICOSP.2010.5656960
  • Filename
    5656960