Title :
Improving performance of gene selection by unsupervised learning
Author :
Wang, Mingvi ; Wu, Ping ; Xia, Shunren
Author_Institution :
Coll. of Life Sci., Zhejiang Univ., Hangzhou, China
Abstract :
Selection of significant genes via expression profiles is an important problem in microarray experiments for diseases classification and prediction. Genes of interest are typically selected by a statistical significance test and the top ranked genes were used. A problem with this approach is that many of these genes are highly correlated. For classification purposes it required to have distinct but still highly informative genes. In this paper, we proposed an unsupervised feature selection algorithm to resolve this problem. The method retrieves groups of similar genes by measuring similarity between them whereby redundancy therein is removed. This does not need any search and therefore, is fast. Real biological data experiments have shown that this approach will significantly improve existing classifiers.
Keywords :
biology computing; genetics; pattern classification; statistical analysis; unsupervised learning; biological data experiments; diseases classification; genes; microarray experiments; unsupervised feature selection; unsupervised learning; Biomedical engineering; Biomedical measurements; Clustering algorithms; Diseases; Educational institutions; Filters; Greedy algorithms; Instruments; Testing; Unsupervised learning;
Conference_Titel :
Neural Networks and Signal Processing, 2003. Proceedings of the 2003 International Conference on
Conference_Location :
Nanjing
Print_ISBN :
0-7803-7702-8
DOI :
10.1109/ICNNSP.2003.1279209