Model-based non-negative matrix factorization for single-channel speech separation

Author

Zheng, Nengheng ; Lee, Tan ; Mak, Chun-Man

Author_Institution

Coll. of Inf. Engineergin, Shenzhen Univ., Shenzhen, China

fYear

2011

fDate

14-16 Sept. 2011

Firstpage

1

Lastpage

4

Abstract

A model-based non-negative matrix factorization (NMF) algorithm is formulated for single-channel speech source separation. With linguistic priors of the speech sources, the state-aligned spectral envelopes of these sources are inferred from acoustic models. Being initialized with the computed spectral envelopes, NMF processing is implemented to estimate the spectral envelope trajectory of each source. Subsequently the source speech is generated by reshaping the mixture spectra according to the estimated spectral envelopes, followed by the reduction of interfering harmonic components. An iterative process is developed for more reliable time alignment and hence better performance of separation. Experimental results show that two speech sources with equal intensity can be successfully separated by the proposed algorithm.

Keywords

matrix decomposition; source separation; speech processing; acoustic models; model based nonnegative matrix factorization; single channel speech separation; spectral envelope estimation; spectral envelope trajectory; Acoustics; Hidden Markov models; Source separation; Speech; Speech enhancement; Trajectory; non-negative matrix separation; spectral envelope; speech source separation;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing, Communications and Computing (ICSPCC), 2011 IEEE International Conference on

Conference_Location

Xi´an

Print_ISBN

978-1-4577-0893-0

Type

conf

DOI

10.1109/ICSPCC.2011.6061795

Filename

6061795