Title :
Voice conversion based on Non-negative Matrix Factorization in noisy environments
Author :
Fujii, Teruya ; Aihara, Ryo ; Takashima, Ryoichi ; Takiguchi, Tetsuya ; Ariki, Yasuo
Author_Institution :
Grad. Sch. of Syst. Inf., Kobe Univ., Kobe, Japan
Abstract :
This paper presents a voice conversion (VC) technique for noisy environments. We prepared parallel exemplars (dictionary) that consist of the source and target exemplars, which have the same texts uttered by the source and target speakers. The input source signal is decomposed into the source exemplars, noise exemplars obtained from the input signal, and their weights (activities). Then, the converted signal is obtained by calculating the linear combination of the target exemplars and the weights which are calculated using the source exemplars. In the proposed method, a Gaussian Mixture Model (GMM) -based conversion method is also applied to the feature vectors generated by the sparse coding in order to compensate a mismatch between the weights of source and target exemplars. The effectiveness of this method was confirmed by comparing its effectiveness with that of a conventional method.
Keywords :
Gaussian processes; dictionaries; matrix decomposition; mixture models; noise (working environment); source separation; speech enhancement; GMM based conversion method; Gaussian mixture model; VC technique; dictionary; feature vectors; linear combination; noise exemplars; noisy environments; nonnegative matrix factorization; parallel exemplars; source exemplars; source speakers; sparse coding; target exemplars; target speakers; voice conversion; Cepstrum; Dictionaries; Encoding; Feature extraction; Noise; Noise measurement; Speech;
Conference_Titel :
System Integration (SII), 2013 IEEE/SICE International Symposium on
Conference_Location :
Kobe
DOI :
10.1109/SII.2013.6776630