DocumentCode
2010281
Title
A New Hybrid Approach for Unsupervised Gene Selection
Author
Kim, Young Bun ; Gao, Jean
Author_Institution
Dept. of Comput. Sci. & Eng., Texas Univ., Arlington, TX
fYear
2006
fDate
28-29 Sept. 2006
Firstpage
1
Lastpage
8
Abstract
In recent years, unsupervised gene (feature) selection has become an integral part of microarray analysis because of the large number of genes and complexity in biological systems. Principal components analysis (PCA) is one of the approaches which has been applied, even though principal components (PCs) have no clear physical meanings. In this paper, we present a PCA based feature selection within a wrapper framework called PFSBEM (hybrid PCA based feature selection and boost-expectation-maximization clustering). PFSBEM uses a two-step approach to select features. The first step retrieves feature subsets with original physical meaning based on their capacities to reproduce sample projections on PCs. The second step then searches for the best feature subsets that maximize clustering performance. Experiment results clearly show that our feature sets improve the class prediction with respect to the chosen performance criteria
Keywords
biology computing; expectation-maximisation algorithm; genetics; principal component analysis; biological systems; boost-expectation-maximization clustering; feature selection; microarray analysis; principal components analysis; unsupervised gene selection; Biological systems; Clustering algorithms; Computer science; Filters; Gene expression; Personal communication networks; Principal component analysis; Space exploration; Unsupervised learning; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Bioinformatics and Computational Biology, 2006. CIBCB '06. 2006 IEEE Symposium on
Conference_Location
Toronto, Ont.
Print_ISBN
1-4244-0623-4
Electronic_ISBN
1-4244-0624-2
Type
conf
DOI
10.1109/CIBCB.2006.330996
Filename
4133178
Link To Document