• DocumentCode
    735871
  • Title

    CUDA-enabled Hadoop cluster for Sparse Matrix Vector Multiplication

  • Author

    Reza, Motahar ; Sinha, Aman ; Nag, Rajkumar ; Mohanty, Prasant

  • Author_Institution
    High Performance Comput. Lab., Nat. Inst. of Sci. & Technol., Berhampur, India
  • fYear
    2015
  • fDate
    9-11 July 2015
  • Firstpage
    169
  • Lastpage
    172
  • Abstract
    Compute Unified Device Architecture (CUDA) is an architecture and programming model that allows leveraging the high compute-intensive processing power of the Graphical Processing Units (GPUs) to perform general, non-graphical tasks in a massively parallel manner. Hadoop is an open-source software framework that has its own file system, the Hadoop Distributed File System (HDFS), and its own programming model, the Map Reduce, in order to accomplish the tasks of storage of very large amount of data and their fast processing in a distributed manner in a cluster of inexpensive hardware. This paper presents a model and implementation of a Hadoop-CUDA Hybrid approach to perform Sparse Matrix Vector Multiplication (SpMV) of very large matrices in a very high performing manner. Hadoop is used for splitting the input matrix into smaller sub-matrices, storing them on individual data nodes and then invoking the required CUDA kernels on the individual GPU-possessing cluster nodes. The original SpMV is done using CUDA. Such an implementation has been seen to improve the performance of the SpMV operation over very large matrices by speedup of around 1.4 in comparison to non-Hadoop, single-GPU CUDA implementation.
  • Keywords
    data handling; graphics processing units; parallel architectures; CUDA-enabled Hadoop cluster; GPU-possessing cluster nodes; Hadoop distributed file system; Hadoop-CUDA hybrid approach; Map Reduce; compute unified device architecture; data nodes; distributed manner; graphical processing units; high compute-intensive processing power; input matrix; massively parallel manner; open-source software framework; programming model; smaller sub-matrices; sparse matrix vector multiplication; File systems; Graphics processing units; Instruction sets; Java; Kernel; Programming; Sparse matrices; CUDA; GPGPU; Hadoop; MapReduce; SCOO; SpMV;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Recent Trends in Information Systems (ReTIS), 2015 IEEE 2nd International Conference on
  • Conference_Location
    Kolkata
  • Type

    conf

  • DOI
    10.1109/ReTIS.2015.7232872
  • Filename
    7232872