• DocumentCode
    2069784
  • Title

    Model-based non-negative matrix factorization for single-channel speech separation

  • Author

    Zheng, Nengheng ; Lee, Tan ; Mak, Chun-Man

  • Author_Institution
    Coll. of Inf. Engineergin, Shenzhen Univ., Shenzhen, China
  • fYear
    2011
  • fDate
    14-16 Sept. 2011
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    A model-based non-negative matrix factorization (NMF) algorithm is formulated for single-channel speech source separation. With linguistic priors of the speech sources, the state-aligned spectral envelopes of these sources are inferred from acoustic models. Being initialized with the computed spectral envelopes, NMF processing is implemented to estimate the spectral envelope trajectory of each source. Subsequently the source speech is generated by reshaping the mixture spectra according to the estimated spectral envelopes, followed by the reduction of interfering harmonic components. An iterative process is developed for more reliable time alignment and hence better performance of separation. Experimental results show that two speech sources with equal intensity can be successfully separated by the proposed algorithm.
  • Keywords
    matrix decomposition; source separation; speech processing; acoustic models; model based nonnegative matrix factorization; single channel speech separation; spectral envelope estimation; spectral envelope trajectory; Acoustics; Hidden Markov models; Source separation; Speech; Speech enhancement; Trajectory; non-negative matrix separation; spectral envelope; speech source separation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, Communications and Computing (ICSPCC), 2011 IEEE International Conference on
  • Conference_Location
    Xi´an
  • Print_ISBN
    978-1-4577-0893-0
  • Type

    conf

  • DOI
    10.1109/ICSPCC.2011.6061795
  • Filename
    6061795