Title :
Single channel speech enhancement based on masking properties of the human auditory system
Author_Institution :
Signal Process. Lab., Swiss Fed. Inst. of Technol., Lausanne, Switzerland
fDate :
3/1/1999 12:00:00 AM
Abstract :
This paper addresses the problem of single channel speech enhancement at very low signal-to-noise ratios (SNRs) (<10 dB). The proposed approach is based on the introduction of an auditory model in a subtractive-type enhancement process. Single channel subtractive-type algorithms are characterized by a tradeoff between the amount of noise reduction, the speech distortion, and the level of musical residual noise, which can be modified by varying the subtraction parameters. Classical algorithms are usually limited to the use of fixed optimized parameters, which are difficult to choose for all speech and noise conditions. A new computationally efficient algorithm is developed based on masking properties of the human auditory system. It allows for an automatic adaptation in time and frequency of the parametric enhancement system, and finds the best tradeoff based on a criterion correlated with perception. This leads to a significant reduction of the unnatural structure of the residual noise. Objective and subjective evaluation of the proposed system is performed with several noise types form the Noisex-92 database, having different time-frequency distributions. The application of objective measures, the study of the speech spectrograms, as well as subjective listening tests, confirm that the enhanced speech is more pleasant to a human listener. Finally, the proposed enhancement algorithm is tested as a front-end processor for speech recognition in noise, resulting in improved results over classical subtractive-type algorithms
Keywords :
hearing; noise; spectral analysis; speech enhancement; speech recognition; Noisex-92 database; auditory model; automatic adaptation; computationally efficient algorithm; fixed optimized parameters; front-end processor; human auditory system; masking properties; musical residual noise; noise reduction; objective evaluation; objective measures; parametric enhancement system; residual noise; signal-to-noise ratio; single channel speech enhancement; speech distortion; speech recognition; speech spectrograms; subjective listening tests; subtraction parameters; subtractive-type enhancement; time-frequency distributions; very low SNR; Auditory system; Databases; Frequency; Humans; Noise level; Noise reduction; Performance evaluation; Signal to noise ratio; Speech enhancement; Testing;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on