مرکز منطقه ای اطلاع رساني علوم و فناوري - An improved voice conversion method using segmental GMMs and automatic GMM selection

DocumentCode :

2612347

Title :

An improved voice conversion method using segmental GMMs and automatic GMM selection

Author :

Gu, Hung-Yan ; Tsai, Sung-Fung

Author_Institution :

Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan

Volume :

fYear :

2011

fDate :

15-17 Oct. 2011

Firstpage :

2395

Lastpage :

2399

Abstract :

In this paper, the idea of segmental GMMs is proposed for voice conversion. Also, to apply this idea to on-line voice conversion, we have developed an automatic GMM selection algorithm based on dynamic programming. In addition, to map a vector of DCC (discrete cepstrum coefficients) with only one Gaussian mixture, we have designed a mixture selection algorithm. For evaluating the performance of the idea, segmental GMMs, three voice conversion system are constructed and used to conduct listening tests. The results of the listening tests show that segmental GMMs proposed here can indeed help to improve the performances in both timbre similarity and voice quality.

Keywords :

Gaussian processes; dynamic programming; speech processing; Gaussian mixture; automatic GMM selection; discrete cepstrum coefficients; dynamic programming; listening tests; online voice conversion; segmental GMM; timbre similarity; voice quality; Dynamic programming; Harmonic analysis; Heuristic algorithms; Speech; Timbre; Training; Vectors; Gaussian mixture model; discrete cepstrum; harmonic plus noise model; timbre similarity; voice conversion;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Image and Signal Processing (CISP), 2011 4th International Congress on

Conference_Location :

Shanghai

Print_ISBN :

978-1-4244-9304-3

Type :

conf

DOI :

10.1109/CISP.2011.6100692

Filename :

6100692

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2612347