DocumentCode :
2837607
Title :
The Study on Data Mining Methods Based on Rough Set Theory and CART for Incomplete Data
Author :
Lei Hongyan ; Tian Wanglan ; Zou Hanbin
Author_Institution :
Sch. of Comput. Sci. & Technol., Hunan Univ. of Arts & Sci., Changde, China
fYear :
2011
fDate :
17-18 July 2011
Firstpage :
1
Lastpage :
4
Abstract :
Many real-life data sets are incomplete, i.e., some attribute values are missing. Mining incomplete data sets is truly challenging. Among many methods of handling missing attribute values applied in data mining. We will discuss two approaches: rough sets combined with rule induction and the CART system based on surrogate splits. The main objective of this paper is to compare, through experiments, the quality of rough set approaches to missing attribute values with the well-known CART approach. In our experiments we used only lost value interpretation of missing attribute values.
Keywords :
data mining; rough set theory; CART; data mining methods; incomplete data sets; missing attribute values; rough set theory; surrogate splits; Approximation methods; Art; Breast cancer; Data mining; Image segmentation; Iris; Temperature;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits, Communications and System (PACCS), 2011 Third Pacific-Asia Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4577-0855-8
Type :
conf
DOI :
10.1109/PACCS.2011.5990231
Filename :
5990231
Link To Document :
بازگشت