• DocumentCode
    632548
  • Title

    NSC-GA: Search for optimal shrinkage thresholds for nearest shrunken centroid

  • Author

    Vinh Quoc Dang ; Chiou-Peng Lam ; Chang Su Lee

  • Author_Institution
    Sch. of Comput. & Security Sci., Edith Cowan Univ., Joondalup, WA, Australia
  • fYear
    2013
  • fDate
    16-19 April 2013
  • Firstpage
    44
  • Lastpage
    51
  • Abstract
    In this paper, a hybrid approach incorporating the Nearest Shrunken Centroid (NSC) and Genetic Algorithm (GA) is proposed to automatically search for an optimal range of shrinkage threshold values for the NSC to improve feature selection and classification accuracy for high dimensional data. The selection of a threshold value is crucial as it is the key factor in the NSC to find significant relative differences between the overall centroid and the class centroid. However, selecting this threshold value via “trial and error” in empirical approaches can be time-consuming and imprecise. In the proposed NSC-GA approach, shrinkage threshold values for the NSC are encoded as genes in chromosomes that are evaluated using a fitness measure obtained from the classifier in the NSC. The proposed approach automatically searches for the optimal threshold for the NSC by utilizing GA. The proposed approach was evaluated using a number of data sets; Alzheimer´s disease, Colon and Leukemia cancer datasets. Experimental results indicated that the proposed approach finds the optimal range of shrinkage thresholds for each dataset, subsequently leading to a higher classification result and involving a smaller number of features when compared to previous studies.
  • Keywords
    bioinformatics; cancer; genetic algorithms; genetics; Alzheimer´s disease; Colon cancer; Genetic Algorithm; Leukemia cancer; NSC-GA approach; chromosomes; classification accuracy; feature selection; genes; high dimensional data; nearest shrunken centroid; optimal shrinkage threshold; Accuracy; Biological cells; Classification algorithms; Genetic algorithms; Sociology; Statistics; Training; Nearest shrunken centroid; classification; feature selection; genetic algorithm; shrinkage thresholds;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2013 IEEE Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/CIBCB.2013.6595387
  • Filename
    6595387