مرکز منطقه ای اطلاع رساني علوم و فناوري - Efficient model re-estimation in voice conversion

DocumentCode :

700105

Title :

Efficient model re-estimation in voice conversion

Author :

Jilei Tian ; Popa, Victor ; Nurminen, Jani

Author_Institution :

Media Lab., Nokia Res. Center, Nokia, Finland

fYear :

2008

fDate :

25-29 Aug. 2008

Firstpage :

Lastpage :

Abstract :

Voice conversion systems aim at converting an utterance spoken by one speaker to sound as speech uttered by a second speaker. Over the last few years, the interest towards voice conversion has risen immensely. Gaussian mixture model (GMM) based techniques have been found to be efficient in the transformation of features represented as scalars or vectors. However, reasonably large amount of aligned training data is needed to achieve good results. To solve this problem, this paper presents an efficient model re-estimation scheme. The proposed technique is based on adjusting an existing well-trained conversion model for a new target speaker with only a very small amount of training data. The experimental results provided in the paper demonstrate the efficiency of the re-estimation approach in line spectral frequency conversion and show that the proposed approach can reach good performance while using only a very limited amount of adaptation data.

Keywords :

Gaussian processes; mixture models; speaker recognition; speech processing; GMM technique; Gaussian mixture model; efficient model reestimation scheme; feature transformation; line spectral frequency conversion; speech utterance; utterance to sound conversion; voice conversion system; Abstracts;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing Conference, 2008 16th European

Conference_Location :

Lausanne

ISSN :

2219-5491

Type :

conf

Filename :

7080637

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=700105