• DocumentCode
    3605833
  • Title

    A New HMM-Based Voice Conversion Methodology Evaluated on Monolingual and Cross-Lingual Conversion Tasks

  • Author

    Percybrooks, Winston S. ; Moore, Elliot

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Univ. del Norte, Barranquilla, Colombia
  • Volume
    23
  • Issue
    12
  • fYear
    2015
  • Firstpage
    2298
  • Lastpage
    2310
  • Abstract
    The work presented here proposes a new voice conversion (VC) approach based on hidden Markov models (HMMs) for spectral conversion and excitation estimation. This paper is divided in two main parts: First, an initial HMM-based VC system is presented and compared to a state-of-the-art ML-GMM VC system in a monolingual conversion scenario with parallel training data; The second part shows the necessary modifications to use the HMM VC system in a cross-lingual conversion scenario and compares it with a cross-lingual VC system based on artificial neural networks (ANNs). The results of the tests show improved performance of the proposed HMM VC system compared with both the ML-GMM and ANN-based VC alternatives, while at the same time keeping most of the flexibility afforded by the ANN approach with respect to training data requirements.
  • Keywords
    computational linguistics; hidden Markov models; neural nets; spectral analysis; speech processing; ANN approach; HMM-based VC system; HMM-based voice conversion methodology; ML-GMM VC system; artificial neural networks; cross-lingual conversion task; excitation estimation; hidden Markov models; monolingual conversion task; parallel training data; performance improvement; spectral conversion; Feature extraction; Hidden Markov models; Languages; Speech processing; Voice processing; Voice conversion; cross-lingual conversion; excitation estimation; hidden Markov models (HMMs); subjective testing;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2479040
  • Filename
    7268860