DocumentCode
3394465
Title
Distributed Peer-to-Peer Cooperative Partitional-Divisive Clustering for gene expression datasets
Author
Kashef, R. ; Kamel, M.S.
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Waterloo, Waterloo, ON
fYear
2008
fDate
15-17 Sept. 2008
Firstpage
143
Lastpage
150
Abstract
Clustering techniques are helpful in understanding gene regulation, cellular processes, and subtypes of cells. A major thrust of gene expression analysis over the last twenty years has been the acquisition of enormous amount of various distributed sources of gene expression datasets. Thus, it is becoming increasingly important to perform clustering of distributed data in-place, without the need to pool it first into a central node. The general goal of distributed clustering is achieving a level of speedup than the centralized approaches. A recent study shows that centralized cooperative clustering outperforms the non-cooperative centralized clustering approaches. In this paper a novel distributed cooperative partitional-divisive clustering in a peer-to-peer network is presented. The distributed CPDC approach is based on intermediate cooperation between the Partitional k-means and the divisive bisecting k-means in a distributed Peer-to-Peer network to produce better global solutions. Computational experiments were conducted to test the performance of the distributed CPDC approach using different gene expression datasets. Undertaken experimental results show that the performance of the distributed CPDC method is better than that of the non-cooperative distributed k-means and distributed bisecting k-means. Thus a new cooperative technique for distributed gene expression repositories is efficiently presented to discover regularities and genes that may span multiple nodes.
Keywords
bioinformatics; cellular biophysics; data mining; genetics; peer-to-peer computing; cellular processes; clustering techniques; distributed cooperative partitional-divisive clustering; gene expression datasets; gene regulation; k-means partition; peer-to-peer network; Clustering algorithms; Computational complexity; Computer architecture; Distributed computing; Gene expression; Machine intelligence; Master-slave; Partitioning algorithms; Pattern analysis; Peer to peer computing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence in Bioinformatics and Computational Biology, 2008. CIBCB '08. IEEE Symposium on
Conference_Location
Sun Valley, ID
Print_ISBN
978-1-4244-1778-0
Electronic_ISBN
978-1-4244-1779-7
Type
conf
DOI
10.1109/CIBCB.2008.4675771
Filename
4675771
Link To Document