• DocumentCode
    730766
  • Title

    Activity-mapping non-negative matrix factorization for exemplar-based voice conversion

  • Author

    Aihara, Ryo ; Takiguchi, Tetsuya ; Ariki, Yasuo

  • Author_Institution
    Grad. Sch. of Syst. Inf., Kobe Univ., Kobe, Japan
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4899
  • Lastpage
    4903
  • Abstract
    Voice conversion (VC) is being widely researched in the field of speech processing because of increased interest in using such processing in applications such as personalized Text-To-Speech systems. We present in this paper an exemplar-based VC method using Non-negative Matrix Factorization (NMF), which is different from conventional statistical VC. In our previous exemplar-based VC method, input speech is represented by the source dictionary and its sparse coefficients. The source and the target dictionaries are fully coupled and the converted voice is constructed from the source coefficients and the target dictionary. In this paper, we propose an Activity-mapping NMF approach and introduce mapping matrices between source and target sparse coefficients. The effectiveness of this method was confirmed by comparing its effectiveness with that of a conventional Gaussian Mixture Model (GMM)-based method and a conventional NMF-based method.
  • Keywords
    compressed sensing; matrix decomposition; signal representation; speech processing; GMM-based method; Gaussian mixture model-based method; activity-mapping NMF approach; converted voice; exemplar-based VC method; mapping matrices; nonnegative matrix factorization; personalized text-to-speech systems; source coefficients; source dictionary; sparse coefficients; speech processing; target dictionary; voice conversion; Dictionaries; Estimation; Gaussian mixture model; Matrix converters; Sparse matrices; Speech; Training data; NMF; non-negative matrix factorization; sparse representation; voice conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178902
  • Filename
    7178902