Title :
Modeling Human Auditory Perception for Noise-Robust Speech Recognition
Author_Institution :
Dept. of BioSystems, Korea Adv. Inst. of Sci. & Technol., Daejeon
Abstract :
Several bio-inspired models of human auditory perception are reported for robust speech recognition in real-world noisy environment. The developed mathematical models of the human auditory pathway are integrated into a speech recognition system, of which 3 components are (1) the nonlinear feature extraction model from cochlea to auditory cortex, (2) the binaural processing model at superior olivery complex, and (3) the top-down attention model from higher brain to the cochlea. The unsupervised independent component analysis shows that some auditory feature extraction and binaural processing mechanisms follow information theory with sparse representation. The ICA-based features resemble frequency-limited features extracted from the cochlea and also more complex time-frequency features from the inferior colliculus and auditory cortex. The top-down attention model shows how the pre-acquired knowledge in our brain filters out irrelevant features or fills in missing features in the sensory data. Both the top-down attention and bottom-up binaural processing are combined into a single system for high-noisy cases. This auditory model requires extensive computing, and several VLSI implementations had been developed for real-time applications. Experimental results demonstrate much better recognition performance in realworld noisy environments
Keywords :
feature extraction; hearing; independent component analysis; speech recognition; auditory cortex; binaural processing model; cochlea; human auditory pathway; human auditory perception; noise-robust speech recognition; nonlinear feature extraction model; superior olivery complex; top-down attention model; unsupervised independent component analysis; Brain modeling; Feature extraction; Frequency; Humans; Independent component analysis; Information theory; Mathematical model; Noise robustness; Speech recognition; Working environment noise;
Conference_Titel :
Neural Networks and Brain, 2005. ICNN&B '05. International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-9422-4
DOI :
10.1109/ICNNB.2005.1614867