Title :
Text-independent voice conversion based on state mapped codebook
Author :
Zhang, Meng ; Tao, Jianhua ; Tian, Jilei ; Wang, Xia
Author_Institution :
Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing
fDate :
March 31 2008-April 4 2008
Abstract :
Voice conversion has become more and more important in speech technology, but most of current works have to use parallel utterances of both source and target speaker as the training corpus, which limits the application of the technology. In the paper, we propose a new method of text- independent voice conversion which uses non-parallel corpus for the training. The hidden Markov model (HMM) is used to represent the phonetic structure of training speech and to generate the training pairs of source and target speakers by mapping the HMM states between source and target speeches. Then, HMM state mapped codebooks are generated to create the mapping function for the text- independent voice conversion. The subjective experiments based on ABX tests and MOS tests show that the method proposed in the paper gets the similar conversion performance and better speech quality compared to the conventional voice conversion systems.
Keywords :
hidden Markov models; speech processing; speech synthesis; HMM state mapped codebooks; hidden Markov model; nonparallel corpus; phonetic structure; speech technology; text-independent voice conversion; training pair generation; Automation; Character generation; Databases; Gaussian distribution; Hidden Markov models; Laboratories; Pattern recognition; Speech recognition; System testing; Training data; hidden Markov model; state mapped codebook; text-independent; voice conversion;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518682