DocumentCode :
2612347
Title :
An improved voice conversion method using segmental GMMs and automatic GMM selection
Author :
Gu, Hung-Yan ; Tsai, Sung-Fung
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
Volume :
5
fYear :
2011
fDate :
15-17 Oct. 2011
Firstpage :
2395
Lastpage :
2399
Abstract :
In this paper, the idea of segmental GMMs is proposed for voice conversion. Also, to apply this idea to on-line voice conversion, we have developed an automatic GMM selection algorithm based on dynamic programming. In addition, to map a vector of DCC (discrete cepstrum coefficients) with only one Gaussian mixture, we have designed a mixture selection algorithm. For evaluating the performance of the idea, segmental GMMs, three voice conversion system are constructed and used to conduct listening tests. The results of the listening tests show that segmental GMMs proposed here can indeed help to improve the performances in both timbre similarity and voice quality.
Keywords :
Gaussian processes; dynamic programming; speech processing; Gaussian mixture; automatic GMM selection; discrete cepstrum coefficients; dynamic programming; listening tests; online voice conversion; segmental GMM; timbre similarity; voice quality; Dynamic programming; Harmonic analysis; Heuristic algorithms; Speech; Timbre; Training; Vectors; Gaussian mixture model; discrete cepstrum; harmonic plus noise model; timbre similarity; voice conversion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image and Signal Processing (CISP), 2011 4th International Congress on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9304-3
Type :
conf
DOI :
10.1109/CISP.2011.6100692
Filename :
6100692
Link To Document :
بازگشت