DocumentCode :
1652119
Title :
Blind estimation of reverberation time based on spectro-temporal modulation filtering
Author :
Feifei Xiong ; Goetze, Stefan ; Meyer, Bernd T.
Author_Institution :
Project Group Hearing-, Speech- & Audio-Technol. (HSA), Fraunhofer Inst. for Digital Media Technol. (IDMT), Oldenburg, Germany
fYear :
2013
Firstpage :
443
Lastpage :
447
Abstract :
A novel method for blind estimation of the reverberation time (RT60) is proposed based on applying spectro-temporal modulation filters to time-frequency representations. 2D-Gabor filters arranged in a filterbank enable an analysis of the properties of temporal, spectral, and spectro-temporal filtering for this task. Features are used as input to a multi-layer perceptron (MLP) classifier combined with a simple decision rule that attributes a specific RT60 to a given utterance and allows to assess the reliability of the approach for different resolutions of RT60 classification. While the filter set including temporal, spectral, and spectro-temporal filters already outperforms an MFCC baseline, the error rates are further reduced when relying on diagonal spectro-temporal filters alone. The average error rate is 1.9% for the best feature set, which corresponds to a relative reduction of 58.3% compared to the MFCC baseline for RT60s in 0.1 s resolution.
Keywords :
Gabor filters; blind source separation; filtering theory; multilayer perceptrons; reverberation; signal classification; time-frequency analysis; 2D-Gabor filters; MFCC baseline; MLP classifier; RT60 classification; blind estimation; decision rule; diagonal spectro-temporal filters; error rates; filterbank; multilayer perceptron classifier; relative reduction; reliability; reverberation time; spectral filtering; spectro-temporal filtering; spectro-temporal modulation filtering; spectro-temporal modulation filters; time-frequency representations; Estimation; Frequency modulation; Mel frequency cepstral coefficient; Reverberation; Speech; 2D Gabor filterbank; Blind reverberation time estimation; spectro-temporal modulation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6637686
Filename :
6637686
Link To Document :
بازگشت