مرکز منطقه ای اطلاع رساني علوم و فناوري - Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion

DocumentCode :

1323599

Title :

Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion

Author :

Wu, Zhizheng ; Kinnunen, Tomi ; Chng, Eng Siong ; Li, Haizhou

Author_Institution :

Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore

Volume :

Issue :

fYear :

2012

Firstpage :

914

Lastpage :

917

Abstract :

A robust voice conversion function relies on a large amount of parallel training data, which is difficult to collect in practice. To tackle the sparse parallel training data problem in voice conversion, this paper describes a mixture of factor analyzers method which integrates prior knowledge from non-parallel speech into the training of conversion function. The experiments on CMU ARCTIC corpus show that the proposed method improves the quality and similarity of converted speech. With both objective and subjective evaluations, we show the proposed method outperforms the baseline GMM method.

Keywords :

speech processing; CMU ARCTIC corpus; baseline GMM method; factor analyzers; nonparallel speech; objective evaluations; robust voice conversion function; sparse parallel training data problem; subjective evaluations; Covariance matrix; Expectation-maximization algorithms; Speech; Training data; Vectors; Voice conversion; factor analysis; mixture of factor analyzers; prior knowledge;

fLanguage :

English

Journal_Title :

Signal Processing Letters, IEEE

Publisher :

ieee

ISSN :

1070-9908

Type :

jour

DOI :

10.1109/LSP.2012.2225615

Filename :

6334423

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1323599