DocumentCode :
1163469
Title :
Efficient selection of discriminative genes from microarray gene expression data for cancer diagnosis
Author :
Huang, D. ; Chow, Tommy W S ; Ma, Eden W M ; Li, Jinyan
Author_Institution :
Dept. of Electron. Eng., City Univ. of Hong Kong, Kowloon, China
Volume :
52
Issue :
9
fYear :
2005
Firstpage :
1909
Lastpage :
1918
Abstract :
A new mutual information (MI)-based feature-selection method to solve the so-called large p and small n problem experienced in a microarray gene expression-based data is presented. First, a grid-based feature clustering algorithm is introduced to eliminate redundant features. A huge gene set is then greatly reduced in a very efficient way. As a result, the computational efficiency of the whole feature-selection process is substantially enhanced. Second, MI is directly estimated using quadratic MI together with Parzen window density estimators. This approach is able to deliver reliable results even when only a small pattern set is available. Also, a new MI-based criterion is proposed to avoid the highly redundant selection results in a systematic way. At last, attributed to the direct estimation of MI, the appropriate selected feature subsets can be reasonably determined.
Keywords :
array signal processing; cancer; feature extraction; genetics; medical image processing; pattern clustering; Parzen window density estimator; cancer diagnosis; discriminative genes; feature-selection; grid-based feature clustering; grid-based redundancy elimination; microarray gene expression data; quadratic mutual information; Cancer; Clustering algorithms; Computational efficiency; DNA; Fluorescence; Gene expression; Mutual information; Probes; Redundancy; Support vector machines; Cancer diagnosis; feature selection; grid-based redundancy elimination; microarray gene expression data; quadratic mutual information (QMI);
fLanguage :
English
Journal_Title :
Circuits and Systems I: Regular Papers, IEEE Transactions on
Publisher :
ieee
ISSN :
1549-8328
Type :
jour
DOI :
10.1109/TCSI.2005.852013
Filename :
1506990
Link To Document :
بازگشت