• DocumentCode
    1653415
  • Title

    A precise estimation of vocal tract parameters for high quality voice morphing

  • Author

    Xu, Ning ; Yang, Zhen

  • Author_Institution
    Inst. of Signal Process. & Transm., Nanjing Univ. of Post & Telecommun., Nanjing
  • fYear
    2008
  • Firstpage
    684
  • Lastpage
    687
  • Abstract
    One of the most recent models for voice conversion is the classical LPC analysis-synthesis model combined with GMM, which aims to separate information from excitation and vocal tract and to learn the transformation rules with statistical methods. However, it does not work well as it is supposed to be due to the inaccuracy of the extracted feature information as well as the overly-smoothed spectral converted by traditional GMM. In this paper, we propose a novel method to solve the problem which is based on the technique of the separation of glottal waveforms and the prediction of the excitations. The final result shows that not only are the transformed vocal tract parameters matching the target one better, but also is the high quality of the synthesized speech preserved.
  • Keywords
    Gaussian processes; Markov processes; speech synthesis; GMM; classical LPC analysis-synthesis model; glottal waveforms; high quality voice morphing; speech synthesis; statistical methods; vocal tract parameters; voice conversion; Data mining; Feature extraction; Information analysis; Linear predictive coding; Signal analysis; Signal processing; Signal synthesis; Speech analysis; Speech synthesis; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, 2008. ICSP 2008. 9th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-2178-7
  • Electronic_ISBN
    978-1-4244-2179-4
  • Type

    conf

  • DOI
    10.1109/ICOSP.2008.4697223
  • Filename
    4697223