• DocumentCode
    57911
  • Title

    NMF-based Target Source Separation Using Deep Neural Network

  • Author

    Tae Gyoon Kang ; Kisoo Kwon ; Jong Won Shin ; Nam Soo Kim

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Seoul Nat. Univ., Seoul, South Korea
  • Volume
    22
  • Issue
    2
  • fYear
    2015
  • fDate
    Feb. 2015
  • Firstpage
    229
  • Lastpage
    233
  • Abstract
    Non-negative matrix factorization (NMF) is one of the most well-known techniques that are applied to separate a desired source from mixture data. In the NMF framework, a collection of data is factorized into a basis matrix and an encoding matrix. The basis matrix for mixture data is usually constructed by augmenting the basis matrices for independent sources. However, target source separation with the concatenated basis matrix turns out to be problematic if there exists some overlap between the subspaces that the bases for the individual sources span. In this letter, we propose a novel approach to improve encoding vector estimation for target signal extraction. Estimating encoding vectors from the mixture data is viewed as a regression problem and a deep neural network (DNN) is used to learn the mapping between the mixture data and the corresponding encoding vectors. To demonstrate the performance of the proposed algorithm, experiments were conducted in the speech enhancement task. The experimental results show that the proposed algorithm outperforms the conventional encoding vector estimation scheme.
  • Keywords
    encoding; estimation theory; matrix decomposition; mixture models; neural nets; source separation; speech enhancement; vectors; DNN; NMF-based target source separation; basis matrix; concatenated basis matrix; data factorization; deep neural network; encoding matrix; encoding vector estimation; encoding vectors estimation; mixture data; nonnegative matrix factorization; speech enhancement task; target signal extraction; target source separation; Encoding; Noise; Signal processing algorithms; Source separation; Speech; Speech enhancement; Vectors; Deep neural network; dictionary learning; non-negative matrix factorization; speech enhancement; target source separation;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2014.2354456
  • Filename
    6892992