Title :
Keyword spotting based on the analysis of template matching distances
Author :
Barakat, M.S. ; Ritz, C.H. ; Stirling, D.A.
Author_Institution :
Sch. of Electr., Comput. & Telecommun. Eng., Univ. of Wollongong, Wollongong, NSW, Australia
Abstract :
This paper presents a system for speaker independent keyword spotting (KWS) in continuous speech using a spoken example template. The approach, based on Dynamic Time Warping (DTW) for matching the template to a test utterance, does not require any modelling or training as required in alternative techniques such as the Hidden Markov Model (HMM). This is of particular relevance to applications such as detection of words that have not been adequately represented in a training database (e.g. searching for topical words that are emerging in society). Introduced is the use of the DTW distance histogram for automatic estimation of similarity thresholds for every keyword-utterance pair. Experiments conducted on a wide range of speech sentences and keywords show that when only a few examples of the keyword are available, the proposed system has higher recall ratio than a HMM-based approach.
Keywords :
estimation theory; speech recognition; DTW distance histogram; KWS; continuous speech; dynamic time warping; keyword-utterance pair; similarity threshold estimation; speaker independent keyword spotting; spoken example template; template matching distance; Databases; Feature extraction; Hidden Markov models; Histograms; Speech; Training; Vectors; Automatic Speech Recognition; Dynamic Time Warping (DTW); Hidden Markov Model (HMM); Keyword Spotting;
Conference_Titel :
Signal Processing and Communication Systems (ICSPCS), 2011 5th International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4577-1179-4
Electronic_ISBN :
978-1-4577-1178-7
DOI :
10.1109/ICSPCS.2011.6140822