DocumentCode
3408535
Title
A mixed factors model for dimension reduction and extraction of a group structure in gene expression data
Author
Yoshida, Ryo ; Higuchi, Tomoyuki ; Imoto, Seiya
Author_Institution
Graduate Univ. for Adv. Studies, Tokyo, Japan
fYear
2004
fDate
16-19 Aug. 2004
Firstpage
161
Lastpage
172
Abstract
When we cluster tissue samples on the basis of genes, the number of observations to be grouped is much smaller than the dimension of feature vector. In such a case, the applicability of conventional model-based clustering is limited since the high dimensionality of feature vector leads to overfilling during the density estimation process. To overcome such difficulty, we attempt a methodological extension of the factor analysis. Our approach enables us not only to prevent from the occurrence of overfilling, but also to handle the issues of clustering, data compression and extracting a set of genes to be relevant to explain the group structure. The potential usefulness are demonstrated with the application to the leukemia dataset.
Keywords
biological tissues; biology computing; blood; cancer; data compression; genetics; physiological models; statistical analysis; cluster analysis; data compression; density estimation process; dimension reduction; factor analysis; feature vector; gene expression data; group structure extraction; leukemia; mixed factors model; tissue samples; Bioinformatics; Data compression; Data mining; Diseases; Eigenvalues and eigenfunctions; Gene expression; Genomics; Humans; Mathematics; Principal component analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN
0-7695-2194-0
Type
conf
DOI
10.1109/CSB.2004.1332429
Filename
1332429
Link To Document