• DocumentCode
    2496091
  • Title

    A Residue-Based Cluster Validity Index for Gene Expression Data Biclustering

  • Author

    Tsai, Chieh-Yuan ; Chiu, Chuang-Cheng

  • Author_Institution
    Dept. of Ind. Eng. & Manage., Yuan Ze Univ., Chungli, Taiwan
  • fYear
    2009
  • fDate
    11-13 June 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Biclustering consists in simultaneous partitioning of the set of genes and the set of their conditions into biclusters using the gene expression data. In theory, the automated variable weighting K-means clustering algorithm (W-K-means) is proper to conduct the biclustering issue. However, it is critical for the W-K-means algorithm to assign the number of biclusters, K, because the quality of biclustering result highly depends on the parameter setting. In this paper, we proposed a novel residue-based cluster validity index to determine the K value. The residue is an indicator of the coherence degree of its corresponding expression level with respect to remaining expression levels within a bicluster. The evaluation of coherent tendency using residues is easier than that using expression levels, so analyzing the mean squared residue (MSR) model which takes the residue into account is helpful for the biclustering issue. The main concept of our proposed index lies in translating the result of the W-K-means algorithm, including the gene-bicluster membership matrix and the condition-bicluster membership matrix, to match the mean squared residue (MSR) model. Therefore, the appropriate number of biclusters generated by the W-K-means algorithm can be determined based on the MSR model so that the determination result becomes meaningful and reasonable.
  • Keywords
    biology computing; genetics; statistical analysis; W-K-means algorithm; cluster validity index; coherence degree; condition-bicluster membership matrix; data biclustering; gene expression; gene-bicluster membership matrix; mean squared residue; weighting K-means clustering; Clustering algorithms; Clustering methods; Coherence; DNA; Engineering management; Gene expression; Industrial engineering; Partitioning algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering , 2009. ICBBE 2009. 3rd International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-2901-1
  • Electronic_ISBN
    978-1-4244-2902-8
  • Type

    conf

  • DOI
    10.1109/ICBBE.2009.5162235
  • Filename
    5162235