Title :
Efficient model re-estimation in voice conversion
Author :
Jilei Tian ; Popa, Victor ; Nurminen, Jani
Author_Institution :
Media Lab., Nokia Res. Center, Nokia, Finland
Abstract :
Voice conversion systems aim at converting an utterance spoken by one speaker to sound as speech uttered by a second speaker. Over the last few years, the interest towards voice conversion has risen immensely. Gaussian mixture model (GMM) based techniques have been found to be efficient in the transformation of features represented as scalars or vectors. However, reasonably large amount of aligned training data is needed to achieve good results. To solve this problem, this paper presents an efficient model re-estimation scheme. The proposed technique is based on adjusting an existing well-trained conversion model for a new target speaker with only a very small amount of training data. The experimental results provided in the paper demonstrate the efficiency of the re-estimation approach in line spectral frequency conversion and show that the proposed approach can reach good performance while using only a very limited amount of adaptation data.
Keywords :
Gaussian processes; mixture models; speaker recognition; speech processing; GMM technique; Gaussian mixture model; efficient model reestimation scheme; feature transformation; line spectral frequency conversion; speech utterance; utterance to sound conversion; voice conversion system; Abstracts;
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne