DocumentCode :
3010584
Title :
Point process models of spectro-temporal modulation events for speech recognition
Author :
Jansen, Aren ; Mesgarani, Nima ; Niyogi, Partha
fYear :
2010
fDate :
7-10 Nov. 2010
Firstpage :
104
Lastpage :
108
Abstract :
Neurobiological research has uncovered the existence of cortical neurons in various animal species tuned to particular spectro-temporal modulations (STM) in the auditory stimulus. Other findings indicate that temporal statistics of the resulting neural spike trains may encode the underlying content of species-specific communication calls. With this motivation, we present an alternative approach to speech recognition based on point process statistical models of the local maxima events produced by a cortically-inspired spectro-temporal filter bank. We demonstrate the computational adequacy of this approach on the practical task of keyword spotting.
Keywords :
speech recognition; statistics; animal species; auditory stimulus; cortical neurons; cortically-inspired spectro-temporal filter bank; neural spike trains; neurobiological research; point process models; species-specific communication; spectro-temporal modulation events; spectro-temporal modulations; speech recognition; temporal statistics; Acoustics; Detectors; Hidden Markov models; Modulation; Speech; Speech recognition; Training; point process models; spectro-temporal modulation features; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers (ASILOMAR), 2010 Conference Record of the Forty Fourth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
ISSN :
1058-6393
Print_ISBN :
978-1-4244-9722-5
Type :
conf
DOI :
10.1109/ACSSC.2010.5757477
Filename :
5757477
Link To Document :
بازگشت