• DocumentCode
    178404
  • Title

    Transductive nonnegative matrix factorization for semi-supervised high-performance speech separation

  • Author

    Naiyang Guan ; Long Lan ; Dacheng Tao ; Zhigang Luo ; Xuejun Yang

  • Author_Institution
    Sci. & Technol. on Parallel & Distrib. Process. Lab., Nat. Univ. of Defense Technol., Changsha, China
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    2534
  • Lastpage
    2538
  • Abstract
    Regarding the non-negativity property of the magnitude spectrogram of speech signals, nonnegative matrix factorization (NMF) has obtained promising performance for speech separation by independently learning a dictionary on the speech signals of each known speaker. However, traditional NM-F fails to represent the mixture signals accurately because the dictionaries for speakers are learned in the absence of mixture signals. In this paper, we propose a new transductive NMF algorithm (TNMF) to jointly learn a dictionary on both speech signals of each speaker and the mixture signals to be separated. Since TNMF learns a more descriptive dictionary by encoding the mixture signals than that learned by NMF, it significantly boosts the separation performance. Experiments results on a popular TIMIT dataset show that the proposed TNMF-based methods outperform traditional NMF-based methods for separating the monophonic mixtures of speech signals of known speakers.
  • Keywords
    encoding; matrix decomposition; speaker recognition; TIMIT dataset; TNMF; descriptive dictionary; dictionary learning; encoding; known speaker; magnitude spectrogram; mixture signals; monophonic mixtures; nonnegativity property; semisupervised high-performance speech separation; speech signals; transductive nonnegative matrix factorization; Dictionaries; Silicon; Spectrogram; Speech; Speech processing; Time-domain analysis; Training; Nonnegative matrix factorization; speech separation; transductive learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854057
  • Filename
    6854057