• DocumentCode
    20259
  • Title

    Mixtures of Local Dictionaries for Unsupervised Speech Enhancement

  • Author

    Minje Kim ; Smaragdis, Paris

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • Volume
    22
  • Issue
    3
  • fYear
    2015
  • fDate
    Mar-15
  • Firstpage
    293
  • Lastpage
    297
  • Abstract
    We propose a novel extension of Nonnegative Matrix Factorization (NMF) that models a signal with multiple local dictionaries activated sparsely. This set of local dictionaries for a source, e.g., speech, disjointly constitute a superset that is more discriminative than an ordinary NMF dictionary, because its local structures represent the source´s manifold better. A block sparsity constraint is used to regularize the NMF solutions so that only one or a small number of blocks are active at a given time. Moreover, a concentrationz prior further regularizes each block of bases to be close to each other for better locality preservation. We test the proposed Mixture of Local Dictionaries (MLD) on single-channel speech enhancement tasks and show that it outperforms the state of the art technology by up to 2 dB in signal-to-distortion ratio, especially in the unsupervised environment where neither the speaker identity nor the type of noise is known in advance.
  • Keywords
    dictionaries; matrix decomposition; speech enhancement; unsupervised learning; MLD; NMF; mixture of local dictionaries; nonnegative matrix factorization; single-channel speech enhancement tasks; unsupervised speech enhancement; Dictionaries; Manifolds; Noise; Speech; Speech enhancement; Training; Vectors; Manifold learning; nonnegative matrix factorization; speech enhancement;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2014.2346506
  • Filename
    6874558