DocumentCode :
3167586
Title :
Fast spoken query detection using lower-bound Dynamic Time Warping on Graphical Processing Units
Author :
Zhang, Yaodong ; Adl, Kiarash ; Glass, James
Author_Institution :
Comput. Sci. & Artificial Intell. Lab., MIT, Cambridge, MA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
5173
Lastpage :
5176
Abstract :
In this paper we present a fast unsupervised spoken term detection system based on lower-bound Dynamic Time Warping (DTW) search on Graphical Processing Units (GPUs). The lower-bound estimate and the K nearest neighbor DTW search are carefully designed to fit the GPU parallel computing architecture. In a spoken term detection task on the TIMIT corpus, a 55x speed-up is achieved compared to our previous implementation on a CPU without affecting detection performance. On large, artificially created corpora, measurements show that the total computation time of the entire spoken term detection system grows linearly with corpus size. On average, searching a keyword on a single desktop computer with modern GPUs requires 2.4 seconds/corpus hour.
Keywords :
computational complexity; graphics processing units; parallel architectures; query processing; speech processing; CPU; GPU parallel computing architecture; TIMIT corpus; artificial created corpora; corpus size; fast spoken query detection; fast unsupervised spoken term detection system; graphical processing units; k nearest neighbor DTW search; lower-bound dynamic time warping; single desktop computer; total computation time; Computer architecture; Computers; Graphics processing unit; Instruction sets; Kernel; Parallel processing; Speech; CUDA; GPU; dynamic time warping; spoken term detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289085
Filename :
6289085
Link To Document :
بازگشت