Title :
Parallel scalability in speech recognition
Author :
You, Kisun ; Chong, Jike ; Yi, Youngmin ; Gonina, Ekaterina ; Hughes, Christopher J. ; Chen, Yen-Kuang ; Sung, Wonyong ; Keutzer, Kurt
Author_Institution :
Seoul Nat. Univ., Seoul, South Korea
fDate :
11/1/2009 12:00:00 AM
Abstract :
We propose four application-level implementation alternatives called algorithm styles and construct highly optimized implementations on two parallel platforms: an Intel Core i7 multicore processor and a NVIDIA GTX280 manycore processor. The highest performing algorithm style varies with the implementation platform. On a 44-min speech data set, we demonstrate substantial speedups of 3.4 X on Core i7 and 10.5 X on GTX280 compared to a highly optimized sequential implementation on Core i7 without sacrificing accuracy. The parallel implementations contain less than 2.5% sequential overhead, promising scalability and significant potential for further speedup on future platforms.
Keywords :
parallel processing; speech processing; Intel Core i7 multicore processor; NVIDIA GTX280 manycore processor; algorithm styles; application-level implementation; parallel scalability; speech recognition; Application software; Engines; Feature extraction; Inference algorithms; Multicore processing; Scalability; Signal processing algorithms; Space exploration; Speech recognition; Vocabulary;
Journal_Title :
Signal Processing Magazine, IEEE
DOI :
10.1109/MSP.2009.934124