Title :
Probabilistic principal component analysis applied to voice conversion
Author :
Wilde, Mark M. ; Martinez, Andrew B.
Author_Institution :
Electr. Eng. & Comput. Sci., Tulane Univ., New Orleans, LA, USA
Abstract :
In our model for voice conversion, we represent the joint probabilistic acoustic space of the source and target speakers with a mixture of probabilistic principal component analyzers (PPCAs). We present a finer resolution of options to the user of the voice conversion system than traditional Gaussian mixture model based conversion. Objective experiments demonstrate that the dimension of the PPCA directly impacts resulting objective performance but saves both time and memory complexity. Subjective tests imply that incremental removal of information does not affect the listener perceptually. Thus, the end user can select with more freedom how well the system should perform.
Keywords :
acoustic signal processing; principal component analysis; probability; speech processing; joint probabilistic acoustic space; memory complexity; probabilistic principal component analysis; source speakers; target speakers; voice conversion; Acoustic testing; Computer science; Covariance matrix; Equations; Frequency; Least squares methods; Loudspeakers; Optimization methods; Principal component analysis; Speech analysis;
Conference_Titel :
Signals, Systems and Computers, 2004. Conference Record of the Thirty-Eighth Asilomar Conference on
Print_ISBN :
0-7803-8622-1
DOI :
10.1109/ACSSC.2004.1399569