Title :
Robust speech recognition using features based on zero crossings with peak amplitudes
Author :
Gajic, B. ; Paliwal, Kuldip K.
Author_Institution :
Dept. of Telecommun., Norwegian Univ. of Sci. & Technol., Trondheim, Norway
Abstract :
The paper presents an extensive study of zero crossings with peak amplitudes (ZCPA) features, that have earlier been shown to outperform both conventional and auditory-based features in the presence of additive noise. The study starts by optimizing different parameters involved in ZCPA feature computation, followed by a comparison of ZCPA and MFCC features on two recognition tasks in different background conditions. The main differences between the two feature types are identified, and their individual effects on ASR performance are evaluated. The importance of a proper choice of analysis frame lengths and filter bandwidths in ZCPA feature extraction is demonstrated. Furthermore, the use of dominant frequency information in ZCPA features is found to be a major reason for increased robustness of ZCPA features compared to MFCC features.
Keywords :
acoustic noise; feature extraction; optimisation; random noise; speech recognition; MFCC features; additive noise; feature extraction; robust speech recognition; zero crossings with peak amplitudes features; Automatic speech recognition; Background noise; Frequency estimation; Gaussian noise; Histograms; Mel frequency cepstral coefficient; Noise robustness; Spatial databases; Speech recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198717