• DocumentCode
    1163469
  • Title

    Efficient selection of discriminative genes from microarray gene expression data for cancer diagnosis

  • Author

    Huang, D. ; Chow, Tommy W S ; Ma, Eden W M ; Li, Jinyan

  • Author_Institution
    Dept. of Electron. Eng., City Univ. of Hong Kong, Kowloon, China
  • Volume
    52
  • Issue
    9
  • fYear
    2005
  • Firstpage
    1909
  • Lastpage
    1918
  • Abstract
    A new mutual information (MI)-based feature-selection method to solve the so-called large p and small n problem experienced in a microarray gene expression-based data is presented. First, a grid-based feature clustering algorithm is introduced to eliminate redundant features. A huge gene set is then greatly reduced in a very efficient way. As a result, the computational efficiency of the whole feature-selection process is substantially enhanced. Second, MI is directly estimated using quadratic MI together with Parzen window density estimators. This approach is able to deliver reliable results even when only a small pattern set is available. Also, a new MI-based criterion is proposed to avoid the highly redundant selection results in a systematic way. At last, attributed to the direct estimation of MI, the appropriate selected feature subsets can be reasonably determined.
  • Keywords
    array signal processing; cancer; feature extraction; genetics; medical image processing; pattern clustering; Parzen window density estimator; cancer diagnosis; discriminative genes; feature-selection; grid-based feature clustering; grid-based redundancy elimination; microarray gene expression data; quadratic mutual information; Cancer; Clustering algorithms; Computational efficiency; DNA; Fluorescence; Gene expression; Mutual information; Probes; Redundancy; Support vector machines; Cancer diagnosis; feature selection; grid-based redundancy elimination; microarray gene expression data; quadratic mutual information (QMI);
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems I: Regular Papers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1549-8328
  • Type

    jour

  • DOI
    10.1109/TCSI.2005.852013
  • Filename
    1506990