Title :
Small-footprint keyword spotting using deep neural networks
Author :
Guoguo Chen ; Parada, Carlos ; Heigold, Georg
Author_Institution :
Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
Our application requires a keyword spotting system with a small memory footprint, low computational cost, and high precision. To meet these requirements, we propose a simple approach based on deep neural networks. A deep neural network is trained to directly predict the keyword(s) or subword units of the keyword(s) followed by a posterior handling method producing a final confidence score. Keyword recognition results achieve 45% relative improvement with respect to a competitive Hidden Markov Model-based system, while performance in the presence of babble noise shows 39% relative improvement.
Keywords :
hidden Markov models; neural nets; speech recognition; telecommunication computing; babble noise; confidence score; deep neural networks; hidden Markov model; high precision; keyword prediction; keyword recognition; low computational cost; posterior handling method; small memory footprint; small-footprint keyword spotting; subword unit prediction; Acoustics; Computational modeling; Hidden Markov models; Neural networks; Speech; Speech processing; Training; Deep Neural Network; Embedded Speech Recognition; Keyword Spotting;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854370