Title :
Empirical comparison of analog and digital auditory preprocessing for automatic speech recognition
Author :
Massengill, Todd M. ; Wilson, Denise M. ; Hasler, Paul E. ; Graham, David W.
Author_Institution :
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
Abstract :
Results from digital and analog filter bank preprocessors are compared in order to establish the validity of analog processing for automatic speech recognition (ASR) systems. Three systems are evaluated using speaker and context independent phoneme recognition tasks. The three ASR systems are identical except for the preprocessing techniques used to derive three signal representations: extraction of (1) the digital mel-frequency spectrum, (2) the mel-frequency spectrum from commercial discrete bandpass filters and (3) the exponential spectrum from analog VLSI bandpass filter bank. The discrete analog system exhibits a 38% increase in recognition accuracy over the digital preprocessing technique. The digital and analog VLSI-based techniques perform comparably (within 3% of each other).
Keywords :
VLSI; signal representation; speech processing; speech recognition; VLSI; auditory preprocessing; automatic speech recognition; context independent phoneme recognition; digital mel-frequency spectrum; exponential spectrum; filter bank preprocessors; preprocessing techniques; recognition accuracy; signal representations; speaker independent phoneme recognition; Automatic speech recognition; Biology computing; Digital signal processing; Electronic mail; Filter bank; Filtering theory; Neuromorphics; Speech processing; Speech recognition; Very large scale integration;
Conference_Titel :
Circuits and Systems, 2002. ISCAS 2002. IEEE International Symposium on
Print_ISBN :
0-7803-7448-7
DOI :
10.1109/ISCAS.2002.1010644