DocumentCode :
3737934
Title :
Parallel ward clustering for chemical compounds using OpenCL
Author :
Mohamed G. Malhat;Ashraf B. El-Sisi
Author_Institution :
Computer Science dept., Faculty of Computers and Information, Menoufia University, Egypt
fYear :
2015
Firstpage :
23
Lastpage :
27
Abstract :
The availability of chemical libraries with millions of compounds makes the process of identifying lead compounds very hard. The identification of these compounds is the backbone step of drug discovery process. Hierarchical clustering algorithms are used for that purpose. One of the most popular hierarchical clustering algorithms that are used in many applications in the drug discovery process is ward clustering algorithm. A main problem with the previous implementations of ward algorithm is its limitation to handle large data sets within a reasonable time and memory resources. In this paper, OpenCL is used to implement ward algorithm. The first two steps of ward (1) proximity matrix computation; (2) finding minimum distance are modified to run in parallel. Four subsets of National Cancer Institute (NCI) dataset are used. The smallest subset contains 500 compounds and largest subset contains 10,000 compounds. The results show that parallel proximity matrix computation saves 92% of time for smallest subset and 99% of time for largest subset. The parallel minimum distance saves 76% of time for smallest subset and 99% of time for largest subset.
Keywords :
"Compounds","Clustering algorithms","Kernel","Drugs","Signal processing algorithms","Chemicals","Prediction algorithms"
Publisher :
ieee
Conference_Titel :
Computer Engineering & Systems (ICCES), 2015 Tenth International Conference on
Type :
conf
DOI :
10.1109/ICCES.2015.7393011
Filename :
7393011
Link To Document :
بازگشت