Title :
A Combination of Rough-Based Feature Selection and RBF Neural Network for Classification Using Gene Expression Data
Author :
Jung-Hsien Chiang ; Shing-Hua Ho
Author_Institution :
Nat. Cheng Kung Univ., Tainan
fDate :
3/1/2008 12:00:00 AM
Abstract :
This paper presents a novel rough-based feature selection method for gene expression data analysis. It can find the relevant features without requiring the number of clusters to be known a priori and identify the centers that approximate to the correct ones. In this paper, we attempt to introduce a prediction scheme that combines the rough-based feature selection method with radial basis function neural network. For further consider the effect of different feature selection methods and classifiers on this prediction process, we use the Naive Bayes and linear support vector machine as classifiers, and compare the performance with other feature selection methods, including information gain and principle component analysis. We demonstrate the performance by several published datasets and the results show that our proposed method can achieve high classification accuracy rate.
Keywords :
Bayes methods; feature extraction; genetics; learning (artificial intelligence); medical computing; pattern classification; principal component analysis; radial basis function networks; rough set theory; support vector machines; Bayes classifier; RBF neural network; gene expression data; linear support vector machine; microarray data; principle component analysis; radial basis function; rough-based feature selection; Clustering algorithms; Computer science; Data analysis; Diseases; Filters; Gene expression; Neural networks; Radial basis function networks; Support vector machine classification; Support vector machines; Information gain (IG); microarray data; principle component analysis; radial basis function (RBF) neural network; rough-based feature selection; Algorithms; Diagnosis, Computer-Assisted; Gene Expression Profiling; Humans; Neoplasm Proteins; Neoplasms; Neural Networks (Computer); Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Tumor Markers, Biological;
Journal_Title :
NanoBioscience, IEEE Transactions on
DOI :
10.1109/TNB.2008.2000142