Title :
Fast keyword detection with sparse time-frequency models
Author :
Kokiopoulou, Effrosyni ; Frossard, Pascal ; Verscheure, Olivier
Author_Institution :
Signal Process. Inst. - ITS, Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne
fDate :
June 23 2008-April 26 2008
Abstract :
We address the problem of keyword spotting in continuous speech streams when training and testing conditions can be different. We propose a keyword spotting algorithm based on sparse representation of speech signals in a time-frequency feature space. The training speech elements are jointly represented in a common subspace built on simple basis functions. The subspace is trained in order to capture the common time-frequency structures from different occurrences of the keywords to be spotted. The keyword spotting algorithm then employs a sliding window mechanism on speech streams. It computes the contribution of successive speech segments in the subspace of interest and evaluates the similarity with the training data. Experimental results on the TIMIT database show the effectiveness and the noise resilience of the low complexity spotting algorithm.
Keywords :
signal representation; speech processing; time-frequency analysis; TIMIT database; continuous speech streams; keyword detection; keyword spotting algorithm; speech signal representation; time-frequency models; Acoustic noise; Acoustic signal detection; Acoustic testing; Noise reduction; Pattern matching; Signal processing algorithms; Speech enhancement; Speech recognition; Time frequency analysis; Working environment noise; Keyword spotting; sparse representations;
Conference_Titel :
Multimedia and Expo, 2008 IEEE International Conference on
Conference_Location :
Hannover
Print_ISBN :
978-1-4244-2570-9
Electronic_ISBN :
978-1-4244-2571-6
DOI :
10.1109/ICME.2008.4607626