DocumentCode
1855974
Title
A GPU Implementation of Fast Parallel Markov Clustering in Bioinformatics Using EllPACK-R Sparse Data Format
Author
Bustamam, Alhadi ; Burrage, Kevin ; Hamilton, Nicholas A.
Author_Institution
Inst. for Mol. Biosci., Univ. of Queensland, Brisbane, QLD, Australia
fYear
2010
fDate
2-3 Dec. 2010
Firstpage
173
Lastpage
175
Abstract
The massively parallel computing using graphical processing unit (GPU), which based on tens of thousands of parallel threats within hundreds of GPU´s streaming processors, has gained broad popularity and attracted researchers in a wide range of application areas from finance, computer aided engineering, computational fluid dynamics, game physics, numerics, science, medical imaging, life science, and so on, including molecular biology and bioinformatics. Meanwhile, Markov clustering algorithm (MCL) has become one of the most effective and highly cited methods to detect and analyze the communities/clusters within an interaction network dataset on many real world problems such us social, technological, or biological networks including protein-protein interaction networks. However, as the dataset become bigger and bigger, the computation time of MCL algorithm become slower and slower. Hence, GPU computing is an interesting and challenging alternative to attempt to improve the MCL performance. In this poster paper we introduce our improvement of MCL performance based on ELLPACK-R sparse dataset format using GPU computing with the Compute Unified Device Architecture tool (CUDA) from NVIDIA (called CUDA-MCL). As the results show the significant improvement in CUDA-MCL performance and with the low-cost and widely available GPU devices in the market today, this CUDA-MCL implementation is allowing large-scale parallel computation on off-the-shelf desktop machines. Moreover the GPU computing approaches potentially may contribute to significantly change the way bioinformaticians and biologists compute and interact with their data.
Keywords
Markov processes; bioinformatics; computer graphic equipment; coprocessors; parallel architectures; parallel processing; pattern clustering; CUDA-MCL; ELLPACK-R sparse data format; GPU computing; GPU streaming processors; NVIDIA; bioinformatics; compute unified device architecture tool; fast parallel Markov clustering; graphical processing unit; interaction network dataset; large-scale parallel computation; off-the-shelf desktop machines; parallel threats; Bioinformatics; Clustering algorithms; Graphics processing unit; Kernel; Markov processes; Proteins; Sparse matrices; Bioinformatics; CUDA; EllPACK-R; GPU computing; MCl; PPI networks; Parallel Markov clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Computing, Control and Telecommunication Technologies (ACT), 2010 Second International Conference on
Conference_Location
Jakarta
Print_ISBN
978-1-4244-8746-2
Electronic_ISBN
978-0-7695-4269-0
Type
conf
DOI
10.1109/ACT.2010.10
Filename
5675816
Link To Document