DocumentCode
697983
Title
Spectral covariance in prior distributions of non-negative matrix factorization based speech separation
Author
Virtanen, Tuomas
Author_Institution
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
fYear
2009
fDate
24-28 Aug. 2009
Firstpage
1933
Lastpage
1937
Abstract
This paper proposes an algorithm for modeling the covariance of the spectrum in the prior distributions of non-negative matrix factorization (NMF) based sound source separation. Supervised NMF estimates a set of spectrum basis vectors for each source, and then represents a mixture signal using them. When the exact characteristics of the sources are not known in advance, it is advantageous to train prior distributions of spectra instead of fixed spectra. Since the frequency bands in natural sound sources are strongly correlated, we model the distributions with full-covariance Gaussian distributions. Algorithms for training and applying the distributions are presented. The proposed methods produce better separation quality that the reference methods. Demonstration signals are available at www.cs.tut.fi/~tuomasv.
Keywords
Gaussian distribution; matrix decomposition; source separation; frequency bands; full-covariance Gaussian distributions; mixture signal; natural sound sources; nonnegative matrix factorization; prior distributions; sound source separation; spectral covariance; spectrum basis vectors; speech separation; Speech; Stability analysis; Testing; Training; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2009 17th European
Conference_Location
Glasgow
Print_ISBN
978-161-7388-76-7
Type
conf
Filename
7077556
Link To Document