مرکز منطقه ای اطلاع رساني علوم و فناوري - CUDA-enabled Hadoop cluster for Sparse Matrix Vector Multiplication

DocumentCode :

735871

Title :

CUDA-enabled Hadoop cluster for Sparse Matrix Vector Multiplication

Author :

Reza, Motahar ; Sinha, Aman ; Nag, Rajkumar ; Mohanty, Prasant

Author_Institution :

High Performance Comput. Lab., Nat. Inst. of Sci. & Technol., Berhampur, India

fYear :

2015

fDate :

9-11 July 2015

Firstpage :

169

Lastpage :

172

Abstract :

Compute Unified Device Architecture (CUDA) is an architecture and programming model that allows leveraging the high compute-intensive processing power of the Graphical Processing Units (GPUs) to perform general, non-graphical tasks in a massively parallel manner. Hadoop is an open-source software framework that has its own file system, the Hadoop Distributed File System (HDFS), and its own programming model, the Map Reduce, in order to accomplish the tasks of storage of very large amount of data and their fast processing in a distributed manner in a cluster of inexpensive hardware. This paper presents a model and implementation of a Hadoop-CUDA Hybrid approach to perform Sparse Matrix Vector Multiplication (SpMV) of very large matrices in a very high performing manner. Hadoop is used for splitting the input matrix into smaller sub-matrices, storing them on individual data nodes and then invoking the required CUDA kernels on the individual GPU-possessing cluster nodes. The original SpMV is done using CUDA. Such an implementation has been seen to improve the performance of the SpMV operation over very large matrices by speedup of around 1.4 in comparison to non-Hadoop, single-GPU CUDA implementation.

Keywords :

data handling; graphics processing units; parallel architectures; CUDA-enabled Hadoop cluster; GPU-possessing cluster nodes; Hadoop distributed file system; Hadoop-CUDA hybrid approach; Map Reduce; compute unified device architecture; data nodes; distributed manner; graphical processing units; high compute-intensive processing power; input matrix; massively parallel manner; open-source software framework; programming model; smaller sub-matrices; sparse matrix vector multiplication; File systems; Graphics processing units; Instruction sets; Java; Kernel; Programming; Sparse matrices; CUDA; GPGPU; Hadoop; MapReduce; SCOO; SpMV;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Recent Trends in Information Systems (ReTIS), 2015 IEEE 2nd International Conference on

Conference_Location :

Kolkata

Type :

conf

DOI :

10.1109/ReTIS.2015.7232872

Filename :

7232872

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=735871