Title :
The Study on Data Mining Methods Based on Rough Set Theory and CART for Incomplete Data
Author :
Lei Hongyan ; Tian Wanglan ; Zou Hanbin
Author_Institution :
Sch. of Comput. Sci. & Technol., Hunan Univ. of Arts & Sci., Changde, China
Abstract :
Many real-life data sets are incomplete, i.e., some attribute values are missing. Mining incomplete data sets is truly challenging. Among many methods of handling missing attribute values applied in data mining. We will discuss two approaches: rough sets combined with rule induction and the CART system based on surrogate splits. The main objective of this paper is to compare, through experiments, the quality of rough set approaches to missing attribute values with the well-known CART approach. In our experiments we used only lost value interpretation of missing attribute values.
Keywords :
data mining; rough set theory; CART; data mining methods; incomplete data sets; missing attribute values; rough set theory; surrogate splits; Approximation methods; Art; Breast cancer; Data mining; Image segmentation; Iris; Temperature;
Conference_Titel :
Circuits, Communications and System (PACCS), 2011 Third Pacific-Asia Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4577-0855-8
DOI :
10.1109/PACCS.2011.5990231