Title :
A Global Discretization and Attribute Reduction Algorithm Based on K-Means Clustering and Rough Sets Theory
Author_Institution :
Sch. of Manage. & Econ., Beijing Inst. of Technol., Beijing, China
fDate :
Nov. 30 2009-Dec. 1 2009
Abstract :
The knowledge reduction function of rough sets theory is specific on discrete data, while most attributes of decision tables are continuous. Therefore a global discretization and attribute reduction algorithm is proposed based on clustering and rough sets theory. After comparing different discretization methods, the k-means clustering algorithm is used. In order to avoid the shortcomings of k-means clustering algorithm, the F-analysis of variance statistics and support strength of condition attributes are introduced to control the discretization effectiveness. A rational clustering number is derived according to the dependency index to meet the prerequisite of the rough set theory. After that, the attributes are reduced by using rough set theory, and decision rules are induced. Lastly an example is proposed to illustrate the feasibility and effectiveness of the algorithm.
Keywords :
data reduction; decision tables; fuzzy set theory; rough set theory; statistical analysis; F-analysis; attribute reduction algorithm; decision rules; decision tables; dependency index; discrete data; global discretization algorithm; k-means clustering number; knowledge reduction function; rough sets theory; variance statistics; Clustering algorithms; Databases; Frequency conversion; Information systems; Knowledge acquisition; Knowledge management; Rough sets; Set theory; Statistics; Technology management; Attribute reduction; Decision table; Discretization; K-means clustering; Rough set;
Conference_Titel :
Knowledge Acquisition and Modeling, 2009. KAM '09. Second International Symposium on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3888-4
DOI :
10.1109/KAM.2009.16