DocumentCode :
562635
Title :
Unsupervised feature selection using Tolerance Rough Set based Relative Reduct
Author :
Inbarani, H. Hannah ; Banu, P. K Nizar
Author_Institution :
Dept. of Comput. Sci., Periyar Univ., Salem, India
fYear :
2012
fDate :
30-31 March 2012
Firstpage :
326
Lastpage :
331
Abstract :
Feature selection is a procedure to select highly informative features. The microarray data comprises less number of samples with more number of genes. Feature selection for gene expression data intends to find a set of genes that best discriminate highly expressed genes from highly suppressed genes. The supervised feature selection methods select features using evaluation function or metric that is related to the decision classes. However, Gene expression datasets are continuous and for such dataset, decision class is not provided. The existing unsupervised feature selection methods are not effective in selecting features which comprises real valued data. Discretizing the original data, results in information loss. An efficient Unsupervised Tolerance Rough Set based Relative Reduct (U-TRS-RelRed) algorithm is presented in this paper. This algorithm uses backward elimination method to remove features from the complete set of original features. K-Means and Rough K-Means algorithms are used to cluster and measure the quality of the reduced data. The proposed approach is compared with existing unsupervised methods and the result demonstrates the efficiency of the proposed algorithm.
Keywords :
biology computing; data handling; feature extraction; pattern clustering; rough set theory; unsupervised learning; U-TRS-RelRed algorithm; backward elimination method; decision classes; evaluation function; evaluation metric; feature removal; gene expression datasets; highly expressed genes; highly suppressed genes; k-means algorithms; microarray data; reduced data quality cluster; reduced data quality measure; rough k-means algorithms; supervised feature selection methods; unsupervised feature selection; unsupervised tolerance rough set based relative reduct algorithm; Algorithm design and analysis; Clustering algorithms; Colon; Noise; Gene Expression data; Relative Reduct; Rough Set; Tolerance Rough Set; Unsupervised Feature Selection; Unsupervised TRS Relative Reduct;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Engineering, Science and Management (ICAESM), 2012 International Conference on
Conference_Location :
Nagapattinam, Tamil Nadu
Print_ISBN :
978-1-4673-0213-5
Type :
conf
Filename :
6215619
Link To Document :
بازگشت