DocumentCode :
35496
Title :
Spatial Source Subtraction Based on Incomplete Measurements of Relative Transfer Function
Author :
Koldovsky, Zbynek ; Malek, Jiri ; Gannot, Sharon
Author_Institution :
Fac. of Mechatron., Inf., & Interdiscipl. Studies, Tech. Univ. of Liberec, Liberec, Czech Republic
Volume :
23
Issue :
8
fYear :
2015
fDate :
Aug. 2015
Firstpage :
1335
Lastpage :
1347
Abstract :
Relative impulse responses between microphones are usually long and dense due to the reverberant acoustic environment. Estimating them from short and noisy recordings poses a long-standing challenge of audio signal processing. In this paper, we apply a novel strategy based on ideas of compressed sensing. Relative transfer function (RTF) corresponding to the relative impulse response can often be estimated accurately from noisy data but only for certain frequencies. This means that often only an incomplete measurement of the RTF is available. A complete RTF estimate can be obtained through finding its sparsest representation in the time-domain: that is, through computing the sparsest among the corresponding relative impulse responses. Based on this approach, we propose to estimate the RTF from noisy data in three steps. First, the RTF is estimated using any conventional method such as the nonstationarity-based estimator by Gannot or through blind source separation. Second, frequencies are determined for which the RTF estimate appears to be accurate. Third, the RTF is reconstructed through solving a weighted l1 convex program, which we propose to solve via a computationally efficient variant of the SpaRSA (Sparse Reconstruction by Separable Approximation) algorithm. An extensive experimental study with real-world recordings has been conducted. It has been shown that the proposed method is capable of improving many conventional estimators used as the first step in most situations.
Keywords :
audio signal processing; blind source separation; compressed sensing; convex programming; microphones; signal reconstruction; signal representation; time-domain analysis; transfer functions; RTF reconstruction; SpaRSA; audio signal processing; blind source separation; compressed sensing; microphone; relative impulse response; relative transfer function incomplete measurement; reverberant acoustic environment; sparse reconstruction by separable approximation algorithm; spatial source subtraction; time-domain; weighted l1 convex program; Approximation methods; Frequency estimation; Frequency-domain analysis; Microphones; Noise; Noise measurement; Noise reduction; ${ell _1}$ norm; Compressed sensing; relative impulse response; relative transfer function (RTF); sparse approximations;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2425213
Filename :
7090956
Link To Document :
بازگشت