• DocumentCode
    231576
  • Title

    Voice conversion based on matrix variate Gaussian mixture model

  • Author

    Saito, Daisuke ; Doi, Hidenobu ; Minematsu, Nobuaki ; Hirose, Keikichi

  • Author_Institution
    Univ. of Tokyo, Tokyo, Japan
  • fYear
    2014
  • fDate
    19-23 Oct. 2014
  • Firstpage
    567
  • Lastpage
    571
  • Abstract
    This paper describes a novel approach to construct a mapping function between a given speaker pair using probability density functions (PDF) of matrix variate. In voice conversion studies, two important functions should be realized: 1) precise modeling of both the source and target feature spaces, and 2) construction of a proper transform function between these spaces. Voice conversion based on Gaussian mixture model (GMM) is widely used because of their flexibility and easiness in handling. In GMM-based approaches, a joint vector space of the source and target is first constructed, and the joint PDF of the two vectors is modeled as GMM in the joint vector space. The joint vector approach mainly focuses on precise modeling of the `joint´ feature space, and does not always construct a proper transform between two feature spaces. In contrast, the proposed method constructs the joint PDF as GMM in a matrix variate space whose row and column respectively correspond to the two functions, and it has potential to precisely model both the characteristics of the feature spaces and the relation between the source and target spaces. Experimental results show that the proposed method contributes to improve the performance of voice conversion.
  • Keywords
    Gaussian processes; matrix algebra; speaker recognition; transforms; GMM; PDF; joint vector space; mapping function; matrix variate Gaussian mixture model; matrix variate space; probability density functions; proper transform function; source feature spaces; target feature spaces; voice conversion; Covariance matrices; Equations; Joints; Mathematical model; Speech; Training; Vectors; Gaussian mixture model; Voice conversion; matrix variate Gaussian mixture model; matrix variate distribution; matrix variate normal;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing (ICSP), 2014 12th International Conference on
  • Conference_Location
    Hangzhou
  • ISSN
    2164-5221
  • Print_ISBN
    978-1-4799-2188-1
  • Type

    conf

  • DOI
    10.1109/ICOSP.2014.7015068
  • Filename
    7015068