Title :
Phoneme cluster based state mapping for text-independent voice conversion
Author :
Zhang, Meng ; Tao, Jiaohua ; Nurminen, Jani ; Tian, Jilei ; Wang, Xia
Author_Institution :
Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing
Abstract :
This paper takes phonetic information into account for data alignment in text-independent voice conversion. Hidden Markov models are used for representing the phonetic structure of training speech. States belonging to same phoneme are grouped together to form a phoneme cluster. A state mapped codebook based transformation is established using information on the corresponding phoneme clusters from source and targets speech and weighted linear transform. For each source vector, several nearest clusters are considered simultaneously while mapping in order to generate a continuous and stable transform. Experimental results indicate that the proposed use of phonetic information increases the similarity between converted speech and target speech. The proposed technique is applicable to both intra-lingual and cross-lingual voice conversion.
Keywords :
hidden Markov models; linguistics; speech processing; transforms; cross-lingual voice conversion; data alignment; hidden Markov model; intralingual voice conversion; phoneme cluster; phonetic information; state mapped codebook based transformation; text-independent voice conversion; training speech; weighted linear transform; Automation; Databases; Hidden Markov models; Laboratories; Pattern recognition; Research and development; Simultaneous localization and mapping; Speech recognition; Training data; Vectors; Hidden Markov Model; state mapping; text-independent voice conversion;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960575