DocumentCode
1362883
Title
Parallel scalability in speech recognition
Author
You, Kisun ; Chong, Jike ; Yi, Youngmin ; Gonina, Ekaterina ; Hughes, Christopher J. ; Chen, Yen-Kuang ; Sung, Wonyong ; Keutzer, Kurt
Author_Institution
Seoul Nat. Univ., Seoul, South Korea
Volume
26
Issue
6
fYear
2009
fDate
11/1/2009 12:00:00 AM
Firstpage
124
Lastpage
135
Abstract
We propose four application-level implementation alternatives called algorithm styles and construct highly optimized implementations on two parallel platforms: an Intel Core i7 multicore processor and a NVIDIA GTX280 manycore processor. The highest performing algorithm style varies with the implementation platform. On a 44-min speech data set, we demonstrate substantial speedups of 3.4 X on Core i7 and 10.5 X on GTX280 compared to a highly optimized sequential implementation on Core i7 without sacrificing accuracy. The parallel implementations contain less than 2.5% sequential overhead, promising scalability and significant potential for further speedup on future platforms.
Keywords
parallel processing; speech processing; Intel Core i7 multicore processor; NVIDIA GTX280 manycore processor; algorithm styles; application-level implementation; parallel scalability; speech recognition; Application software; Engines; Feature extraction; Inference algorithms; Multicore processing; Scalability; Signal processing algorithms; Space exploration; Speech recognition; Vocabulary;
fLanguage
English
Journal_Title
Signal Processing Magazine, IEEE
Publisher
ieee
ISSN
1053-5888
Type
jour
DOI
10.1109/MSP.2009.934124
Filename
5230811
Link To Document