Title :
A Multi-GPU Hitting Set Algorithm for GRNs Inference
Author :
Carastan-Santos, Danilo ; Yokoingawa De Camargo, Raphael ; Correa Martins, David ; Siang Wun Song ; Silva Rozante, Luiz Carlos ; Ferreira Borelli, Fabrizio
Author_Institution :
Univ. Fed. do ABC, Santo Andre, Brazil
Abstract :
Gene regulatory networks inference is one of the crucial problems of the Systems Biology field. It is still an open problem, mainly because of its high dimensionality (thousands of genes) with a limited number of samples (dozens), making it difficult to estimate dependencies among genes. Besides the estimation problem, another important hindrance is the inherent computational complexity of GRN inference methods. In this work, we focus on circumventing performance issues of a technique based on signal perturbations to infer gene dependencies. One of its main steps consists in solving the Hitting Set problem (HSP), which is NP-Hard. There are many proposals to obtain approximate or exact solutions to this problem. One of these proposals consists of a Graphical Processing Unit (GPU) based algorithm to obtain exact solutions to the HSP. However, such method is not scalable for real size GRNs. We propose an extension of the HSP algorithm to deal with input sets containing thousands of variables by introducing innovations in the data structures and a sorting scheme to allow efficient discarding of Hitting Set non-solution candidates. We provide an implementation for multi-core CPUs and GPU clusters. Our experimental results show that the usage of the sorting scheme brings speedups of up to 3.5 in the CPU implementation. Moreover, using a single GPU, we could obtain an additional speedup of up to 4.7, in comparison with the multithreaded CPU implementation. Finally, usage of eight GPUs from a GPU cluster brought an additional speedup of up to 6.6. Combining all techniques, speedups above 60 were obtained for the parallel part of the algorithm.
Keywords :
computational complexity; graphics processing units; multiprocessing systems; GPU clusters; GRN inference; NP-Hard; gene dependencies; gene regulatory networks inference; graphical processing unit; hitting set problem; multiGPU hitting set algorithm; multicore CPU; signal perturbations; systems biology field; Approximation algorithms; Clustering algorithms; Gene expression; Graphics processing units; Inference algorithms; Silicon; Sorting; CUDA; Cluster of GPUs; GPU Computing; GRNs Inference; Hitting Set;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location :
Shenzhen
DOI :
10.1109/CCGrid.2015.29