DocumentCode :
1253798
Title :
Upper and lower bounds on the mean of noisy speech: application to minimax classification
Author :
Afify, Mohamed ; Siohan, Olivier ; Lee, Chin-Hui
Author_Institution :
Multimedia Commun. Res. Lab., Lucent Technol. Bell Labs., Murray Hill, NJ, USA
Volume :
10
Issue :
2
fYear :
2002
fDate :
2/1/2002 12:00:00 AM
Firstpage :
79
Lastpage :
88
Abstract :
In this paper, we derive upper and lower bounds on the mean of speech corrupted by additive noise. The bounds are derived in the log spectral domain. Also approximate bounds on the first and second order time derivatives are developed. It is also shown how to transform these bounds to the mel frequency cepstral coefficient (MFCC) domain. The proposed bounds are used to define the mismatch neighborhood for minimax classification. It is shown that this parametric neighborhood works quite well for artificially added noise and for a real-life mismatch scenario (moving car environment) which does not fully conform with the theoretical conditions used to derive the bounds. In contrast to traditional neighborhood structure for minimax classification, no empirical tuning of the bounds is required. It is believed that the applicability of the derived bounds is not limited to a minimax setting and can be potentially used to develop various compensation scenarios in the log spectral domain
Keywords :
acoustic noise; cepstral analysis; minimax techniques; pattern classification; spectral-domain analysis; speech recognition; MFCQ domain; additive noise; compensation scenarios; log spectral domain; lower bounds; mel frequency cepstral coefficient domain; minimax classification; mismatch neighborhood; moving car environment; noisy speech; parametric neighborhood; real-life mismatch scenario; time derivatives; upper bounds; Additive noise; Distortion measurement; Mel frequency cepstral coefficient; Minimax techniques; Noise robustness; Speech enhancement; Speech recognition; Statistics; Uncertainty; Working environment noise;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.985545
Filename :
985545
Link To Document :
بازگشت