Title :
Perceptual Properties of Current Speech Recognition Technology
Author :
Hermansky, Hynek ; Cohen, J.R. ; Stern, Richard M.
Author_Institution :
Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
In recent years, a number of feature extraction procedures for automatic speech recognition (ASR) systems have been based on models of human auditory processing, and one often hears arguments in favor of implementing knowledge of human auditory perception and cognition into machines for ASR. This paper takes a reverse route, and argues that the engineering techniques for automatic recognition of speech that are already in widespread use are often consistent with some well-known properties of the human auditory system.
Keywords :
feature extraction; hearing; speech recognition; ASR systems; automatic speech recognition systems; engineering techniques; feature extraction procedures; human auditory perception; human auditory processing; human auditory system; perceptual property; reverse route; speech recognition technology; Auditory system; Cavity resonators; Educational institutions; Feature extraction; Resonant frequency; Speech recognition; Auditory perception; feature extraction; speech recognition;
Journal_Title :
Proceedings of the IEEE
DOI :
10.1109/JPROC.2013.2252316