Title :
Voice conversion based on matrix variate Gaussian mixture model
Author :
Saito, Daisuke ; Doi, Hidenobu ; Minematsu, Nobuaki ; Hirose, Keikichi
Author_Institution :
Univ. of Tokyo, Tokyo, Japan
Abstract :
This paper describes a novel approach to construct a mapping function between a given speaker pair using probability density functions (PDF) of matrix variate. In voice conversion studies, two important functions should be realized: 1) precise modeling of both the source and target feature spaces, and 2) construction of a proper transform function between these spaces. Voice conversion based on Gaussian mixture model (GMM) is widely used because of their flexibility and easiness in handling. In GMM-based approaches, a joint vector space of the source and target is first constructed, and the joint PDF of the two vectors is modeled as GMM in the joint vector space. The joint vector approach mainly focuses on precise modeling of the `joint´ feature space, and does not always construct a proper transform between two feature spaces. In contrast, the proposed method constructs the joint PDF as GMM in a matrix variate space whose row and column respectively correspond to the two functions, and it has potential to precisely model both the characteristics of the feature spaces and the relation between the source and target spaces. Experimental results show that the proposed method contributes to improve the performance of voice conversion.
Keywords :
Gaussian processes; matrix algebra; speaker recognition; transforms; GMM; PDF; joint vector space; mapping function; matrix variate Gaussian mixture model; matrix variate space; probability density functions; proper transform function; source feature spaces; target feature spaces; voice conversion; Covariance matrices; Equations; Joints; Mathematical model; Speech; Training; Vectors; Gaussian mixture model; Voice conversion; matrix variate Gaussian mixture model; matrix variate distribution; matrix variate normal;
Conference_Titel :
Signal Processing (ICSP), 2014 12th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-2188-1
DOI :
10.1109/ICOSP.2014.7015068