Title :
Local linear transformation for voice conversion
Author :
Popa, Victor ; Silen, Hanna ; Nurminen, Jani ; Gabbouj, Moncef
Author_Institution :
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
Abstract :
Many popular approaches to spectral conversion involve linear transformations determined for particular acoustic classes and compute the converted result as a linear combination between different local transformations in an attempt to ensure a continuous conversion. These methods often produce over-smoothed spectra and parameter tracks. The proposed method computes an individual linear transformation for every feature vector based on a small neighborhood in the acoustic space thus preserving local details. The method effectively reduces the over-smoothing by eliminating undesired contributions from acoustically remote regions. The method is evaluated in listening tests against the well-known Gaussian Mixture Model based conversion, representative of the class of methods involving linear transformations. Perceptual results indicate a clear preference for the proposed scheme.
Keywords :
Gaussian processes; speech processing; Gaussian mixture model based conversion; acoustic classes; acoustic space; feature vector; local linear transformation; spectral conversion; voice conversion; Decision support systems; Euclidean distance; Frequency conversion; Indexes; Speech; Time frequency analysis; Gaussian Mixture Model (GMM); Line Spectral Frequencies (LSF); Local Linear Transformation (LLT);
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288922