DocumentCode :
2771266
Title :
Unsupervised Gene Selection For High Dimensional Data
Author :
Kim, Young Bun ; Gao, Jean
Author_Institution :
Dept. of Comput. Sci. & Eng., Texas Univ., Arlington, TX
fYear :
2006
fDate :
16-18 Oct. 2006
Firstpage :
227
Lastpage :
234
Abstract :
In this paper, we present a new hybrid approach for unsupervised gene selection. Hybrid approaches try to utilize different evaluation criteria of the filter approaches and wrapper approaches in different search stages. Our method thus uses a two-step approach to identify informative genes. The first step retrieves gene subsets with original physical meaning based on their capacities to reproduce sample projections on principle components by applying the least-square-estimation based evaluation. The second step then searches for the best gene subsets that maximize clustering performance. When applied to a gene expression dataset of leukemia, the method identified a small set of genes whose expression is highly predictive
Keywords :
cancer; genetics; information retrieval; least squares approximations; medical computing; pattern clustering; principal component analysis; tumours; unsupervised learning; clustering performance; filter approaches; gene evaluation criteria; gene expression dataset; gene subsets retrieval; high dimensional data; informative gene identification; least-square-estimation based evaluation; leukemia; principal component analysis; sample projection reproduction; two-step approach; unsupervised gene selection; wrapper approaches; Clustering algorithms; Computer science; Covariance matrix; Data engineering; Filters; Gene expression; Personal communication networks; Principal component analysis; Space exploration; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
BioInformatics and BioEngineering, 2006. BIBE 2006. Sixth IEEE Symposium on
Conference_Location :
Arlington, VA
Print_ISBN :
0-7695-2727-2
Type :
conf
DOI :
10.1109/BIBE.2006.253339
Filename :
4019664
Link To Document :
بازگشت