Title :
A Review: The Effects of Imperfect Data on Incremental Decision Tree
Author :
Hang Yang ; Aidong Xu ; Huajun Chen ; Cai Yuan
Author_Institution :
Electr. Power Res. Inst., China Southern Power Grid, Guangzhou, China
Abstract :
Decision tree, as one of the most widely used methods in data mining, has been used in many realistic application. Incremental decision tree handles streaming data scenario that is applicable for big data analysis. However, imperfect data are unavoidable in real-world applications. Studying the state-of-art incremental decision tree induction using Hoeffding bound, we investigated the influence of imperfect data on decision tree model. Additionally we found the imperfect data worsen the performance of decision tree learning, resulting in worse accuracy and more consumed resource. This paper would be good reference for the future research. When thinking of a new generation of incremental decision tree, we should try to overcome the negative effects of imperfect data.
Keywords :
Big Data; data analysis; data mining; decision trees; learning (artificial intelligence); Big Data analysis; Hoeffding bound; data mining; decision tree learning; imperfect data effect; incremental decision tree; streaming data scenario handling; Accuracy; Data mining; Data models; Decision trees; Noise; Noise measurement; Vegetation; classification; data mining; data stream mining; incremental decision tree;
Conference_Titel :
P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), 2014 Ninth International Conference on
Conference_Location :
Guangdong
DOI :
10.1109/3PGCIC.2014.34