Title :
Robust word recognition using threaded spectral peaks
Author :
Strope, B. ; Alwan, A.
Author_Institution :
Dept. of Electr. Eng., California Univ., Los Angeles, CA, USA
Abstract :
A novel technique which characterizes the position and motion of dominant spectral peaks in speech, significantly reduces the error-rate of an HMM-based word-recognition system. The technique includes approximate auditory filtering, temporal adaptation, identification of local spectral peaks in each frame, grouping of neighboring peaks into threads, estimation of frequency derivatives, and slowly updating approximations of the threads and their derivatives. This processing provides a frame-based speech representation which is both dependent on perceptually salient aspects of the frame´s immediate context, and is well-suited to segmentally-stationary statistical characterization. In noise, the representation reduces the error-rate obtained with standard Mel-filter-based feature vectors by as much as a factor of 4, and provides improvements over other common feature-vector manipulations
Keywords :
error statistics; feature extraction; filtering theory; hearing; hidden Markov models; signal representation; spectral analysis; speech processing; speech recognition; HMM-based word-recognition system; Mel-filter-based feature vectors; approximate auditory filtering; error-rate reduction; frame-based speech representation; frequency derivatives estimation; local spectral peaks identification; robust word recognition; temporal adaptation; threaded spectral peaks; Automatic speech recognition; Bandwidth; Filtering; Frequency estimation; Motion estimation; Robustness; Speech analysis; Speech processing; Speech recognition; Yarn;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675342