Title :
A Soft Masking Strategy Based on Multichannel Speech Probability Estimation for Source Separation and Robust Speech Recognition
Author :
Hoffmann, Eugen ; Kolossa, Dorothea ; Orglmeister, Reinhold
Author_Institution :
Berlin University of Technology, Electronics and Medical Signal Processing Group, Einsteinufer 17, 10587 Berlin, Germany. eugen.hoffmann.1@tu-berlin.de
Abstract :
In this paper, we present a post processing algorithm that improves the quality of the results of ICA-algorithms by applying a modified speech enhancement technique. The proposed method is based on estimating speech probabilities from the ICA outputs by means of two dimensional correlations. With these probabilities, a soft masking function can be applied on the ICA outputs, which results in significantly increased interferer suppression. In order to avoid negative influences on subsequent speech recognition, missing feature recognition has been applied to robustly recognize the non-linearly processed speech signal. The algorithm has been tested on real-room speech mixtures with a reverberation time of 300ms, where an SIR-improvement of up to 32dB has been obtained, which was 10dB above ICA performance for the same dataset.
Keywords :
Independent component analysis; Reverberation; Robustness; Signal processing; Source separation; Speech coding; Speech enhancement; Speech processing; Speech recognition; Testing;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2007 IEEE Workshop on
Conference_Location :
New Paltz, NY, USA
Print_ISBN :
978-1-4244-1620-2
Electronic_ISBN :
978-1-4244-1619-6
DOI :
10.1109/ASPAA.2007.4393002