• DocumentCode
    1997258
  • Title

    Using MIC to Accelerate a Typical Data-Intensive Application: The Breadth-first Search

  • Author

    Gao Tao ; Lu Yutong ; Suo Guang

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    1117
  • Lastpage
    1125
  • Abstract
    Data-intensive applications draw more and more attentions in the last few years. The breadth-first search (BFS), a typical data-intensive application, is so widely used that the Graph 500 benchmark uses it to rank supercomputers´ performance. The Intel MIC (Many Integrated Core), which is designed for highly parallel computing, hasn´t been fully evaluated for data-intensive applications. In this paper, we discuss how to use MIC to accelerate the BFS. Optimizations both for native mode and for offload mode are discussed. About native mode, we propose optimizations for thread level and data-level parallelism. We exploit the thread-level parallelism by relaxing inter-thread dependence. The optimized algorithm is proved to be more scalable. Data-level parallelism is exploited by 512-bits single instruction multiple data (SIMD) instructions. The maximum speedup we further gain is up to 3.4 times. About offload mode, we present an offload algorithm. By careful task partition and communication optimizations, it can gain speedup for large graphs which can´t run natively on MIC as the limited memory size. We believe that the work is valuable for using MIC to accelerate the BFS. Meanwhile, it´s a general evaluation of the MIC for data-intensive applications.
  • Keywords
    optimisation; parallel machines; parallel processing; tree searching; BFS; Intel MIC; SIMD instruction; breadth-first search; data-level parallelism; inter-thread dependence; many integrated core; parallel computing; single instruction multiple data; supercomputer; thread-level parallelism; typical data-intensive application; Acceleration; Arrays; Microwave integrated circuits; Optimization; Parallel processing; Partitioning algorithms; Vectors; BFS; MIC; data-intensive application; hybrid computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.197
  • Filename
    6650997