Title :
A Temporal Auditory Model with Adaptation for Automatic Speech Recognition
Author :
Haque, Showera ; Togneri, Roberto ; Zaknich, A.
Author_Institution :
Sch. of Electr., Electron. & Comput. Eng., Western Australia Univ., Nedlands, WA, Australia
Abstract :
Rapid and short-term adaptation are dynamic mechanisms of human auditory system. An auditory model based on zero-crossings with peak amplitudes (ZCPA) was used as a front-end for automatic speech recognition (ASR) with the perceptual property of adaptation as determined by psychoacoustic observations. The model performance was evaluated on the isolated digits (TIDIGITS) database using continuous density HMM recognizer in additive noise environment. Experimental results indicate that the ASR performance of the ZCPA may be improved with adaptation over the static baseline performance in white Gaussian and factory noise. The perceptual front-end was also evaluated with dynamic (delta and delta-delta) features added to the adaptation. It was observed that adaptation with dynamic features performed better in factory, babble and car noise over a wide range of SNR values.
Keywords :
AWGN; hidden Markov models; speech recognition; SNR; TIDIGITS; additive noise environment; automatic speech recognition; car noise; continuous density HMM recognizer; delta-delta features; factory noise; human auditory system; isolated digits database; peak amplitudes; psychoacoustic observations; temporal auditory model; white Gaussian noise; zero-crossings; Adaptation model; Additive noise; Auditory system; Automatic speech recognition; Hidden Markov models; Humans; Production facilities; Psychoacoustic models; Psychology; Spatial databases; Auditory system; adaptive system; feature extraction; hidden Markov model; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367276