Title :
Representation of Phonemes in Primary Auditory Cortex: How the Brain Analyzes Speech
Author :
Mesgarani, N. ; David, Stoppa ; Shamma, Sadia
Author_Institution :
Dept. of Electr. & Comput. Eng., Maryland Univ., College Park, MD, USA
Abstract :
Many transformations inspired by the auditory system have improved the performance of automatic speech recognition (ASR) systems. However, humans perform substantially better than today´s ASR systems, suggesting that ASR systems can further benefit from understanding how the brain represents speech. To learn about the cortical representation of speech, we measured the neural responses in the primary auditory cortex to sentences from the TIMIT database. Here we examine how individual phonemes activate different subsets of auditory neurons, reflecting the diversity of neural tuning properties. We find that neurons with different spectro-temporal tuning provide an explicit multidimensional representation of articulatory features independent of speaker and context. This representation that matches the human perception could provide a framework for ASR in adverse conditions.
Keywords :
auditory evoked potentials; brain; neurophysiology; speech processing; speech recognition; TIMIT database; articulatory features; auditory system; automatic speech recognition; multidimensional representation; neural responses; neural tuning properties; phonemes representation; primary auditory cortex; spectro-temporal tuning; Auditory system; Automatic speech recognition; Frequency; Humans; Multidimensional systems; Neurons; Robustness; Speech analysis; Speech recognition; Tuning; Auditory System; speech processing; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367025