• DocumentCode
    3317479
  • Title

    A dynamic Gaussian process for voice conversion

  • Author

    Dong-Yan Huang ; Minghui Dong ; Haizhou Li

  • Author_Institution
    Human Language Technol. Dept., A*STAR, Singapore, Singapore
  • fYear
    2013
  • fDate
    15-19 July 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, we explore Dynamic Gaussian Processes (DGP) based learning techniques for voice conversion. In particular, we propose to use dynamic squared exponential GP with sparse partial least squares (SPLS) technique to model nonlinearities as well as to capture the dynamics in the source data. The concatenation of previous and next frames can well model dynamics. Sparse partial least squares regression is used to find a mapping function in order to overcome the problem of overfitting. The proposed dynamic GP-based learning technique features simple, efficient and high accuracy without massive tuning. The experimental results show that the proposed approach for voice conversion is able to produce good similarity between the original and the converted target voices and achieves a great improvement in the sound quality compared to the state-of-the-art Gaussian mixture-based model.
  • Keywords
    Gaussian processes; learning (artificial intelligence); least squares approximations; regression analysis; speech processing; DGP based learning techniques; SPLS technique; concatenation; converted target voices; dynamic GP-based learning technique; dynamic Gaussian process; dynamic squared exponential GP; mapping function; massive tuning; model dynamics; nonlinearity; sound quality; source data; sparse partial least squares regression; sparse partial least squares technique; state-of-the-art Gaussian mixture-based model; voice conversion; Covariance matrices; Gaussian processes; Kernel; Speech; Speech processing; Training data; Vectors; Gaussian processes; covariance functions; mapping function; sparse partial least squares regression; voice conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo Workshops (ICMEW), 2013 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Type

    conf

  • DOI
    10.1109/ICMEW.2013.6618271
  • Filename
    6618271