• DocumentCode
    3351
  • Title

    Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization

  • Author

    Mohammadiha, Nasser ; Smaragdis, Paris ; Leijon, Arne

  • Author_Institution
    Dept. of Electr. Eng., KTH R. Inst. of Technol., Stockholm, Sweden
  • Volume
    21
  • Issue
    10
  • fYear
    2013
  • fDate
    Oct. 2013
  • Firstpage
    2140
  • Lastpage
    2151
  • Abstract
    Reducing the interference noise in a monaural noisy speech signal has been a challenging task for many years. Compared to traditional unsupervised speech enhancement methods, e.g., Wiener filtering, supervised approaches, such as algorithms based on hidden Markov models (HMM), lead to higher-quality enhanced speech signals. However, the main practical difficulty of these approaches is that for each noise type a model is required to be trained a priori. In this paper, we investigate a new class of supervised speech denoising algorithms using nonnegative matrix factorization (NMF). We propose a novel speech enhancement method that is based on a Bayesian formulation of NMF (BNMF). To circumvent the mismatch problem between the training and testing stages, we propose two solutions. First, we use an HMM in combination with BNMF (BNMF-HMM) to derive a minimum mean square error (MMSE) estimator for the speech signal with no information about the underlying noise type. Second, we suggest a scheme to learn the required noise BNMF model online, which is then used to develop an unsupervised speech enhancement system. Extensive experiments are carried out to investigate the performance of the proposed methods under different conditions. Moreover, we compare the performance of the developed algorithms with state-of-the-art speech enhancement schemes using various objective measures. Our simulations show that the proposed BNMF-based methods outperform the competing algorithms substantially.
  • Keywords
    Wiener filters; hidden Markov models; matrix decomposition; signal denoising; speech enhancement; BNMF model online; BNMF-based methods; Bayesian formulation; MMSE estimator; Wiener filtering; hidden Markov models; interference noise; monaural noisy speech signal; nonnegative matrix factorization; speech enhancement schemes; supervised approaches; supervised speech denoising algorithms; supervised speech enhancement; unsupervised speech enhancement; unsupervised speech enhancement methods; Bayesian inference; HMM; PLCA; nonnegative matrix factorization (NMF); speech enhancement;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2270369
  • Filename
    6544586