مرکز منطقه ای اطلاع رساني علوم و فناوري - Voice Conversion using structrued Gaussian Mixture Model

DocumentCode :

3423945

Title :

Voice Conversion using structrued Gaussian Mixture Model

Author :

Zeng, Daojian ; Yu, Yibiao

Author_Institution :

Sch. of Electron. & Inf. Eng., Soochow Univ., Suzhou, China

fYear :

2010

fDate :

24-28 Oct. 2010

Firstpage :

541

Lastpage :

544

Abstract :

Gaussian Mixture Model (GMM) is commonly used in voice conversion. However, traditional GMM based voice conversion usually extracts a conversion function from parallel corpus, which greatly limits the application of the technology. In an attempt to overcome this drawback, structured Gaussian Mixture Model (SGMM) is applied to model the speaker´s acoustic feature distribution. In particular, two speakers´ isolated SGMMs are aligned based on Acoustic Universal Structure (AUS) theory. Then the conversion function is extracted from two aligned SGMMs in a manner similar to conventional method. The subjective listening tests indicate that the proposed method achieves equivalent speech quality and speaker individuality compared with conventional method.

Keywords :

Gaussian processes; speaker recognition; acoustic universal structure theory; listening tests; speaker acoustic feature distribution; speaker individuality; speech quality; structured Gaussian mixture model; voice conversion; Acoustic distortion; Acoustics; Feature extraction; Hidden Markov models; Joints; Speech; Training; AUS; SGMM; voice conversion;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing (ICSP), 2010 IEEE 10th International Conference on

Conference_Location :

Beijing

Print_ISBN :

978-1-4244-5897-4

Type :

conf

DOI :

10.1109/ICOSP.2010.5656960

Filename :

5656960

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3423945