• DocumentCode
    146646
  • Title

    Performance Optimization of SpMV Using CRS Format by Considering OpenMP Scheduling on CPUs and MIC

  • Author

    Ohshima, Shigetoshi ; Katagiri, Takahiro ; Matsumoto, Morio

  • Author_Institution
    Inf. Technol. Center, Univ. of Tokyo, Tokyo, Japan
  • fYear
    2014
  • fDate
    23-25 Sept. 2014
  • Firstpage
    253
  • Lastpage
    260
  • Abstract
    In this study, we evaluate the performance of sparse matrix-vector multiplication (SpMV) using the compressed row storage (CRS) format on CPUs and MIC. We focus on the relationship between OpenMP scheduling and performance. The performance of SpMV is measured using various OpenMP scheduling settings and the results are analyzed, which show that OpenMP scheduling has a considerable effect on the performance of SpMV. We confirm that some scheduling settings resulted in performance improvements compared with default scheduling for particular matrices. The results of the evaluation show that the performance of SpMV is improved by up to 1.57 times compared with SPARC64 IXfx, 2.47 times compared with Xeon Ivy Bridge-EP, and 2.26 times compared with Knights Corner. Next, we modify the SpMV function of OpenATLib, an auto-tuned numerical library, to consider the scheduling of optimization as an additional SpMV implementation. We measure the performance of the GMRES solver and obtain performance improvements of up to 11.4%. These results will help to improve the performance of various numerical calculation applications.
  • Keywords
    mathematics computing; multiprocessing systems; performance evaluation; processor scheduling; software libraries; sparse matrices; storage management; vectors; CPU; CRS format; GMRES solver; MIC; OpenATLib; OpenMP scheduling; SpMV function; SpMV performance; autotuned numerical library; compressed row storage format; many integrated cores; numerical calculation applications; optimization scheduling; performance improvements; sparse matrix-vector multiplication performance; Dynamic scheduling; Graphics processing units; Hardware; Libraries; Microwave integrated circuits; Schedules; Sparse matrices; Auto-tuning; Open MP; OpenATLib; SpMV; Xabclib;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Embedded Multicore/Manycore SoCs (MCSoc), 2014 IEEE 8th International Symposium on
  • Conference_Location
    Aizu-Wakamatsu
  • Type

    conf

  • DOI
    10.1109/MCSoC.2014.43
  • Filename
    6949479