Title :
An algorithm for spoken keyword spotting via subsequence DTW
Author :
Hongyu Guo ; Dongmei Huang ; Xiaoqun Zhao
Author_Institution :
Inf. Coll., Shanghai Ocean Univ., Shanghai, China
Abstract :
We present an algorithm for spoken keyword spotting using subsequence Dynamic Time Warping (DTW) in spoken documents. Instead of using word or phone string as query terms, we use the utterances of user to act as queries. Query matches in the test data are located using subsequence DTW to search between query templates and reference spoken documents. Subsequence DTW is a variant of DTW technique, which is designed to find multiple similar subsequences between two templates. We introduce subsequence DTW into spoken keyword spotting to realize the keyword spotting under low-resource situations in which no in-domain training material is needed. Experiments using this approach are presented using TIMIT corpus.
Keywords :
document handling; natural language processing; query processing; TIMIT corpus; algorithm; phone string; query matches; query templates; query terms; spoken documents; spoken keyword spotting; subsequence DTW; subsequence dynamic time warping; user utterances; Algorithm design and analysis; Educational institutions; Heuristic algorithms; Hidden Markov models; Speech; Springs; Time series analysis; Dynamic programming; Dynamic time warping; Spoken keyword spotting; Subsequence matching;
Conference_Titel :
Network Infrastructure and Digital Content (IC-NIDC), 2012 3rd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2201-0
DOI :
10.1109/ICNIDC.2012.6418819