DocumentCode
2069784
Title
Model-based non-negative matrix factorization for single-channel speech separation
Author
Zheng, Nengheng ; Lee, Tan ; Mak, Chun-Man
Author_Institution
Coll. of Inf. Engineergin, Shenzhen Univ., Shenzhen, China
fYear
2011
fDate
14-16 Sept. 2011
Firstpage
1
Lastpage
4
Abstract
A model-based non-negative matrix factorization (NMF) algorithm is formulated for single-channel speech source separation. With linguistic priors of the speech sources, the state-aligned spectral envelopes of these sources are inferred from acoustic models. Being initialized with the computed spectral envelopes, NMF processing is implemented to estimate the spectral envelope trajectory of each source. Subsequently the source speech is generated by reshaping the mixture spectra according to the estimated spectral envelopes, followed by the reduction of interfering harmonic components. An iterative process is developed for more reliable time alignment and hence better performance of separation. Experimental results show that two speech sources with equal intensity can be successfully separated by the proposed algorithm.
Keywords
matrix decomposition; source separation; speech processing; acoustic models; model based nonnegative matrix factorization; single channel speech separation; spectral envelope estimation; spectral envelope trajectory; Acoustics; Hidden Markov models; Source separation; Speech; Speech enhancement; Trajectory; non-negative matrix separation; spectral envelope; speech source separation;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing, Communications and Computing (ICSPCC), 2011 IEEE International Conference on
Conference_Location
Xi´an
Print_ISBN
978-1-4577-0893-0
Type
conf
DOI
10.1109/ICSPCC.2011.6061795
Filename
6061795
Link To Document