Title :
Unified approach for underdetermined BSS, VAD, dereverberation and DOA estimation with multichannel factorial HMM
Author :
Higuchi, Takuya ; Kameoka, Hirokazu
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
Abstract :
This paper proposes a novel method for simultaneously solving the problems of underdetermined blind source separation (BSS), source activity detection, dereverberation and direction-of-arrival (DOA) estimation by introducing an extension of the "multichannel factorial hidden Markov model (MFH-MM)." The MFHMM is an extension of the multichannel non-negative matrix factorization (NMF) modeL in which the basis spectra are allowed to vary over time according to the transitions of the hidden states. This model has allowed us to perform source separation, source activity detection and dereverberation in a unified manner. In our previous model, the spatial covariance of each source has been treated as a model parameter. This has led the entire generative model to have an unnecessarily high degree of freedom, and thus the parameter inference has been prone to getting trapped into undesired local optima. To reasonably restrict the solution space of the spatial covariance matrix of each source, we propose to describe it as a weighted sum of the fixed spatial covariance matrix corresponding to the discrete set of DOAs. Through the parameter inference, the proposed model allows us to simultaneously solve the problems of underdetermined BSS, source activity detection, dereverberation and DOA estimation. Experimental results revealed that the proposed method was superior to a previous method in terms of the signal-to-distortion ratios of separated signals.
Keywords :
blind source separation; covariance matrices; direction-of-arrival estimation; hidden Markov models; inference mechanisms; matrix decomposition; reverberation; BSS; DOA estimation; MFHMM; NMF model; VAD; dereverberation; direction-of-arrival estimation; fixed spatial covariance matrix; hidden state transition; multichannel factorial HMM; multichannel factorial hidden Markov model; multichannel nonnegative matrix factorization model; parameter inference; signal separation; signal-to-distortion ratio; source activity detection; underdetermined blind source separation; Correlation; Direction-of-arrival estimation; Hidden Markov models; Microphones; Source separation; Speech processing; Time-frequency analysis; DOA; dereverberation; hidden Markov model; non-negative matrix factorization; source separation;
Conference_Titel :
Signal and Information Processing (GlobalSIP), 2014 IEEE Global Conference on
Conference_Location :
Atlanta, GA
DOI :
10.1109/GlobalSIP.2014.7032180