Title :
Feature selection based on rough set and information entropy
Author_Institution :
Dept. of Comput. Sci., California State Univ., Carson, CA, USA
Abstract :
Most existing rough set-based feature selection algorithms suffer from intensive computation of either discernibility functions or positive regions to find attribute reduct In order to improve the efficiency, we develop a new computation model based on relative attribute dependency defined as the proportion of the projection of the decision table on a condition attributes subset to the projection of the decision table on the union of the condition attributes subset and the decision attributes set Two novel algorithms to find optimal reducts of condition attributes based on the relative attribute dependency are implemented using Java 1.4, and are experimented with 10 data sets from UCI machine learning repository. The experiment results demonstrate their usefulness and are analyzed for further research.
Keywords :
data analysis; decision tables; entropy; feature extraction; rough set theory; Java 1.4 version; UCI machine learning repository; data reduction; decision table; feature selection; information entropy; relative attribute dependency; rough set; Computational modeling; Data analysis; Data mining; Information entropy; Java; Machine learning; Machine learning algorithms; Set theory; Statistics; Transaction databases; Rough set theory; data reduction; feature selection; information entropy;
Conference_Titel :
Granular Computing, 2005 IEEE International Conference on
Print_ISBN :
0-7803-9017-2
DOI :
10.1109/GRC.2005.1547256