DocumentCode :
1252142
Title :
Speech analysis and recognition using interval statistics generated from a composite auditory model
Author :
Sheikhzadeh, H. ; Deng, L.
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
Volume :
6
Issue :
1
fYear :
1998
fDate :
1/1/1998 12:00:00 AM
Firstpage :
90
Lastpage :
94
Abstract :
A modeling approach to auditory speech analysis and recognition is proposed and evaluated, where a composite auditory model is used to generate parallel sets of auditory-nerve instantaneous firing rates (IFRs) along the spatial dimension, followed by a processing stage that constructs from the IFRs the interval statistics in a form called the interpeak interval histogram (IPIH). A speech preprocessor is designed that performs transformation on the auditory IPIHs and interfaces the IPIH-based auditory representation with a hidden Markov model-based (HMM-based) speech recognizer. The results demonstrate that the new preprocessor consistently outperforms the conventional mel frequency cepstral coefficient-based (MFCC-based) preprocessor for the signal-to-noise ratio (SNR) level up to at least 16 dB
Keywords :
acoustic signal processing; hearing; hidden Markov models; speech processing; speech recognition; statistical analysis; HMM-based speech recognizer; SNR; auditory representation; auditory speech analysis; auditory-nerve instantaneous firing rates; composite auditory model; hidden Markov model; interpeak interval histogram; interval statistics; mel frequency cepstral coefficient; processing stage; signal-to-noise ratio; spatial dimension; speech preprocessor; Acoustic beams; Acoustic signal processing; Computational complexity; Hidden Markov models; Natural languages; Neural networks; Pattern recognition; Speech analysis; Speech processing; Speech recognition;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.650316
Filename :
650316
Link To Document :
بازگشت