DocumentCode :
3341887
Title :
Efficient building algorithms of decision tree for uniformly distributed uncertain data
Author :
Chenggang Li ; Liping Huang ; Ling Tian
Author_Institution :
Dept. of Precision Instrum. & Machanology, Univ. of Tsinghua, Beijing, China
Volume :
1
fYear :
2011
fDate :
26-28 July 2011
Firstpage :
105
Lastpage :
108
Abstract :
Developing algorithms for uncertain data is one of the most active themes in data mining community. A number of different decision tree classifiers have been studied in order to deal with uncertain data. This paper extends these works. In this paper, we develop a tree-pruning algorithm using sum of the tuples fractions based on probability theory. By pruning, we find that the accuracy of the classifier is improved and the efficiency of building the decision tree is also improved. Besides, we find that under the context of uniformly distribution, increasing the sampling density of the uncertain attribute value can make little contribution to improve the accuracy, but is computationally more costly. So we propose a new method of sampling. Using this sampling method, the execution time of building the decision tree is greatly decreased.
Keywords :
data mining; decision trees; pattern classification; probability; building algorithm; data mining community; decision tree classifier; probability theory; sampling density; sampling method; tree-pruning algorithm; tuples fraction; uncertain attribute value; uniformly distributed uncertain data; Accuracy; Algorithm design and analysis; Classification algorithms; Decision trees; Distributed databases; Satellites; Training; data mining; decision tree; prunning; uncertain data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation (ICNC), 2011 Seventh International Conference on
Conference_Location :
Shanghai
ISSN :
2157-9555
Print_ISBN :
978-1-4244-9950-2
Type :
conf
DOI :
10.1109/ICNC.2011.6022055
Filename :
6022055
Link To Document :
بازگشت