DocumentCode
1060270
Title
Audible Noise Reduction in Eigendomain for Speech Enhancement
Author
You, Chang Huai ; Rahardja, Susanto ; Koh, Soo Ngee
Author_Institution
Inst. for Infocomm Res., Singapore
Volume
15
Issue
6
fYear
2007
Firstpage
1753
Lastpage
1765
Abstract
A signal subspace scheme based on masking properties is proposed for enhancement of speech degraded by additive noise. Since the masking properties are related to the critical frequency band that is derived from the characteristics of human cochlea, the incorporation of masking threshold into a subspace technique requires the transformation between the frequency and eigen domains. We present and apply an invertible transformation between the frequency and eigen domains. In this paper, we use masking properties of the human auditory system to define the audible noise quantity in the eigendomain. We derive the eigen-decomposition of the estimated speech autocorrelation matrix with the assumption of white noise. Subsequently, an audible noise reduction scheme is developed based on a signal subspace technique, and the implementation of our proposed scheme is outlined. We further extend the scheme to the colored noise case. Simulation results show the superiority of our proposed scheme over other existing subspace methods in terms of segmental signal-to-noise ratio (SNR), perceptual evaluation of speech quality (PESQ), modified Bark spectral distortion (MBSD), spectrogram and informal listening tests.
Keywords
matrix algebra; speech enhancement; speech intelligibility; additive noise; audible noise reduction; human auditory system; human cochlea; invertible transformation; masking threshold; modified Bark spectral distortion; perceptual evaluation of speech quality; signal subspace technique; signal-to-noise ratio; speech autocorrelation matrix; speech enhancement eigendomain; Acoustic noise; Additive noise; Auditory system; Degradation; Frequency; Humans; Masking threshold; Signal to noise ratio; Speech coding; Speech enhancement; Audible noise reduction; eigen-decomposition; masking properties; signal subspace; speech enhancement;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2007.899288
Filename
4276768
Link To Document