• DocumentCode
    1362883
  • Title

    Parallel scalability in speech recognition

  • Author

    You, Kisun ; Chong, Jike ; Yi, Youngmin ; Gonina, Ekaterina ; Hughes, Christopher J. ; Chen, Yen-Kuang ; Sung, Wonyong ; Keutzer, Kurt

  • Author_Institution
    Seoul Nat. Univ., Seoul, South Korea
  • Volume
    26
  • Issue
    6
  • fYear
    2009
  • fDate
    11/1/2009 12:00:00 AM
  • Firstpage
    124
  • Lastpage
    135
  • Abstract
    We propose four application-level implementation alternatives called algorithm styles and construct highly optimized implementations on two parallel platforms: an Intel Core i7 multicore processor and a NVIDIA GTX280 manycore processor. The highest performing algorithm style varies with the implementation platform. On a 44-min speech data set, we demonstrate substantial speedups of 3.4 X on Core i7 and 10.5 X on GTX280 compared to a highly optimized sequential implementation on Core i7 without sacrificing accuracy. The parallel implementations contain less than 2.5% sequential overhead, promising scalability and significant potential for further speedup on future platforms.
  • Keywords
    parallel processing; speech processing; Intel Core i7 multicore processor; NVIDIA GTX280 manycore processor; algorithm styles; application-level implementation; parallel scalability; speech recognition; Application software; Engines; Feature extraction; Inference algorithms; Multicore processing; Scalability; Signal processing algorithms; Space exploration; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    1053-5888
  • Type

    jour

  • DOI
    10.1109/MSP.2009.934124
  • Filename
    5230811