Title :
Transmutative voice conversion
Author :
Mohammadi, Seyed Hamidreza ; Kain, Alexander
Author_Institution :
Center for Spoken Language Understanding, Oregon Health & Sci. Univ., Portland, OR, USA
Abstract :
There are two types of voice conversion (VC) systems: generative and transmutative. A generative VC system typically uses a compact parametrization of speech and maps input to output parameters directly; however, the relative low dimensionality of the underlying speech model reduces quality. On the other hand, a transmutative VC system modifies high-dimensional features of a high-fidelity speech model, leaving critical details unmodified. Two versions of transmutative VC approach are implemented and compared to a generative VC approach. The results show that the implemented transmutative VC is significantly better compared to generative VC in terms of quality. The difference between the two VC methods regarding recognition scores are insignificant.
Keywords :
speaker recognition; speech processing; com- pact speech parametrization; generative VC system; high-dimensional features; high-fidelity speech model; recognition scores; transmutative voice conversion; Decision support systems; frequency warping; speech transformation; voice conversion;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639003