• DocumentCode
    3408535
  • Title

    A mixed factors model for dimension reduction and extraction of a group structure in gene expression data

  • Author

    Yoshida, Ryo ; Higuchi, Tomoyuki ; Imoto, Seiya

  • Author_Institution
    Graduate Univ. for Adv. Studies, Tokyo, Japan
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    161
  • Lastpage
    172
  • Abstract
    When we cluster tissue samples on the basis of genes, the number of observations to be grouped is much smaller than the dimension of feature vector. In such a case, the applicability of conventional model-based clustering is limited since the high dimensionality of feature vector leads to overfilling during the density estimation process. To overcome such difficulty, we attempt a methodological extension of the factor analysis. Our approach enables us not only to prevent from the occurrence of overfilling, but also to handle the issues of clustering, data compression and extracting a set of genes to be relevant to explain the group structure. The potential usefulness are demonstrated with the application to the leukemia dataset.
  • Keywords
    biological tissues; biology computing; blood; cancer; data compression; genetics; physiological models; statistical analysis; cluster analysis; data compression; density estimation process; dimension reduction; factor analysis; feature vector; gene expression data; group structure extraction; leukemia; mixed factors model; tissue samples; Bioinformatics; Data compression; Data mining; Diseases; Eigenvalues and eigenfunctions; Gene expression; Genomics; Humans; Mathematics; Principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332429
  • Filename
    1332429