• DocumentCode
    3394465
  • Title

    Distributed Peer-to-Peer Cooperative Partitional-Divisive Clustering for gene expression datasets

  • Author

    Kashef, R. ; Kamel, M.S.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Waterloo, Waterloo, ON
  • fYear
    2008
  • fDate
    15-17 Sept. 2008
  • Firstpage
    143
  • Lastpage
    150
  • Abstract
    Clustering techniques are helpful in understanding gene regulation, cellular processes, and subtypes of cells. A major thrust of gene expression analysis over the last twenty years has been the acquisition of enormous amount of various distributed sources of gene expression datasets. Thus, it is becoming increasingly important to perform clustering of distributed data in-place, without the need to pool it first into a central node. The general goal of distributed clustering is achieving a level of speedup than the centralized approaches. A recent study shows that centralized cooperative clustering outperforms the non-cooperative centralized clustering approaches. In this paper a novel distributed cooperative partitional-divisive clustering in a peer-to-peer network is presented. The distributed CPDC approach is based on intermediate cooperation between the Partitional k-means and the divisive bisecting k-means in a distributed Peer-to-Peer network to produce better global solutions. Computational experiments were conducted to test the performance of the distributed CPDC approach using different gene expression datasets. Undertaken experimental results show that the performance of the distributed CPDC method is better than that of the non-cooperative distributed k-means and distributed bisecting k-means. Thus a new cooperative technique for distributed gene expression repositories is efficiently presented to discover regularities and genes that may span multiple nodes.
  • Keywords
    bioinformatics; cellular biophysics; data mining; genetics; peer-to-peer computing; cellular processes; clustering techniques; distributed cooperative partitional-divisive clustering; gene expression datasets; gene regulation; k-means partition; peer-to-peer network; Clustering algorithms; Computational complexity; Computer architecture; Distributed computing; Gene expression; Machine intelligence; Master-slave; Partitioning algorithms; Pattern analysis; Peer to peer computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology, 2008. CIBCB '08. IEEE Symposium on
  • Conference_Location
    Sun Valley, ID
  • Print_ISBN
    978-1-4244-1778-0
  • Electronic_ISBN
    978-1-4244-1779-7
  • Type

    conf

  • DOI
    10.1109/CIBCB.2008.4675771
  • Filename
    4675771