• DocumentCode
    2319758
  • Title

    A method for design of data-tailored partitioning algorithms for optimizing the number of clusters in microarray analysis

  • Author

    Vukicevic, Milan ; Delibasic, Boris ; Jovanovic, Milos ; Suknovic, Milija ; Obradovic, Zoran

  • Author_Institution
    Center for Bus. Decision-Making, Univ. of Belgrade, Belgrade, Serbia
  • fYear
    2012
  • fDate
    9-12 May 2012
  • Firstpage
    252
  • Lastpage
    259
  • Abstract
    We propose a method for designing a partitioning clustering algorithm from reusable components that is suitable for finding the appropriate number of clusters (K) in microarray data. The proposed method is evaluated on 10 datasets (4 syntetic and 6 real-word microarrays) by considering 1008 reusable-component-based algorithms and four normalization methods. The best performing algorithm were reported on every dataset and also rules were identified for designing microarray-specific clustering algorithms. The obtained results indicate that in the majority of cases a data-tailored clustering algorithm design outperforms the results reported in the literature. In addition, data normalization can have an important influence on algorithm performance. The method proposed in this paper gives insights for design of divisive clustering algorithms that can reveal the optimal K in a microarray dataset.
  • Keywords
    bioinformatics; biological techniques; data handling; genetics; molecular biophysics; pattern clustering; algorithm performance; cluster number; data normalization; data-tailored clustering algorithm; data-tailored partitioning algorithm; divisive clustering algorithm; microarray analysis; partitioning clustering algorithm; reusable-component-based algorithm; Algorithm design and analysis; Clustering algorithms; Gene expression; Indexes; Partitioning algorithms; Principal component analysis; clustering; microarray data; reusable components;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2012 IEEE Symposium on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-1-4673-1190-8
  • Type

    conf

  • DOI
    10.1109/CIBCB.2012.6217238
  • Filename
    6217238