Title :
A New HMM-Based Voice Conversion Methodology Evaluated on Monolingual and Cross-Lingual Conversion Tasks
Author :
Percybrooks, Winston S. ; Moore, Elliot
Author_Institution :
Dept. of Electr. & Electron. Eng., Univ. del Norte, Barranquilla, Colombia
Abstract :
The work presented here proposes a new voice conversion (VC) approach based on hidden Markov models (HMMs) for spectral conversion and excitation estimation. This paper is divided in two main parts: First, an initial HMM-based VC system is presented and compared to a state-of-the-art ML-GMM VC system in a monolingual conversion scenario with parallel training data; The second part shows the necessary modifications to use the HMM VC system in a cross-lingual conversion scenario and compares it with a cross-lingual VC system based on artificial neural networks (ANNs). The results of the tests show improved performance of the proposed HMM VC system compared with both the ML-GMM and ANN-based VC alternatives, while at the same time keeping most of the flexibility afforded by the ANN approach with respect to training data requirements.
Keywords :
computational linguistics; hidden Markov models; neural nets; spectral analysis; speech processing; ANN approach; HMM-based VC system; HMM-based voice conversion methodology; ML-GMM VC system; artificial neural networks; cross-lingual conversion task; excitation estimation; hidden Markov models; monolingual conversion task; parallel training data; performance improvement; spectral conversion; Feature extraction; Hidden Markov models; Languages; Speech processing; Voice processing; Voice conversion; cross-lingual conversion; excitation estimation; hidden Markov models (HMMs); subjective testing;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2015.2479040