DocumentCode :
2996560
Title :
A family of distortion measures base upon projection operation for robust speech recognition
Author :
Mansour, David ; Juang, Biing Hwang
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
fYear :
1988
fDate :
11-14 Apr 1988
Firstpage :
36
Abstract :
The authors aim at the formulation of similarity measures for robust speech recognition. Their consideration focuses on the speech cepstrum derived from linear prediction coefficients (the LPC cepstrum). By using common models for noisy speech, they analytically and empirically show how the ambient noise can affect some important attributes of the LPC cepstrum such as the vector norm, coefficient order, and the direction perturbation. The new findings led them to propose a family of distortion measures based on the projection between two cepstral vectors. Performance evaluation of these measures has been conducted in both speaker-dependent and speaker-independent isolated word recognition tasks. Experimental results show that the new measures cause no degradation in recognition accuracy at high SNR, but perform significantly better when tested under noisy conditions using only clean reference templates. At an SNR of 5 dB, the new measures are shown to be able to achieve a recognition rate equivalent to that obtained by the filtered cepstral measure at 20 dB SNR, demonstrating a gain of 15 dB
Keywords :
noise; speech recognition; 15 dB; LPC cepstrum; SNR; ambient noise; cepstral vectors; coefficient order; degradation; direction perturbation; distortion measures; linear prediction coefficients; noisy speech; performance evaluation; projection operation; recognition accuracy; recognition rate; robust speech recognition; similarity measures; speaker dependent recognition; speaker-independent isolated word recognition; speech cepstrum; vector norm; Cepstral analysis; Cepstrum; Distortion measurement; Gain measurement; Linear predictive coding; Noise robustness; Speech analysis; Speech enhancement; Speech recognition; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
Conference_Location :
New York, NY
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1988.196503
Filename :
196503
Link To Document :
بازگشت