• DocumentCode
    2983643
  • Title

    Exclusive Row Biclustering for Gene Expression Using a Combinatorial Auction Approach

  • Author

    Painsky, Amichai ; Rosset, Samuel

  • Author_Institution
    Sch. of Math. Sci., Tel Aviv Univ., Tel Aviv, Israel
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    1056
  • Lastpage
    1061
  • Abstract
    The availability of large microarray data has led to a growing interest in biclustering methods in the past decade. Several algorithms have been proposed to identify subsets of genes and conditions according to different similarity measures and under varying constraints. In this paper we focus on the exclusive row biclusteing problem (also known as projected clustering) for gene expression data sets, in which each row can only be a member of a single bicluster while columns can participate in multiple clusters. This type of biclustering may be adequate, for example, for clustering groups of cancer patients where each patient (row) is expected to be carrying only a single type of cancer, while each cancer type is associated with multiple (and possibly overlapping) genes (columns). In this paper we present a novel method to identify these exclusive row biclusters through a combination of existing biclustering algorithms and combinatorial auction techniques. We devise an approach for tuning the threshold for our algorithm based on comparison to a null model in the spirit of the Gap statistic approach. We demonstrate our approach on both synthetic and real-world gene expression data and show its power in identifying large span non-overlapping rows sub matrices, while considering their unique nature. The Gap statistic approach succeeds in identifying appropriate thresholds in all our examples.
  • Keywords
    biology computing; cancer; combinatorial mathematics; genetics; molecular biophysics; pattern clustering; statistics; Gap statistic approach; biclustering method; cancer type; combinatorial auction approach; exclusive row biclustering; gene expression; microarray data; null model; projected clustering; similarity measure; Algorithm design and analysis; Cancer; Clustering algorithms; Educational institutions; Gene expression; Optimization; Partitioning algorithms; Biclustering; Exclusive Row Biclustering; Gene Expression; Projected Clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4673-4649-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2012.25
  • Filename
    6413809